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Abstract. We describe recent work of Klyachko, Totaro, Knutson, and Tao, that 
characterizes eigenvalues of sums of Hermitian matrices, and decomposition of tensor 
products of representations of GLn(C). We explain related applications to invariant 
factors of products of matrices, intersections in Grassmann varieties, and singular 
values of sums and products of arbitrary matrices. 
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Recent breakthroughs, primarily by A. Klyachko, B. Totaro, A. Knutson, and 
T. Tao, with contributions by P. Belkale and C. Woodward, have led to complete 
solutions of several old problems involving the various notions in the title. Our aim 
here is to describe this work, and especially to show how these solutions are derived 
from it. Along the way, we will see that these problems are also related to other 
areas of mathematics, including geometric invariant theory, symplectic geometry, 
and combinatorics. In addition, we present some related applications to singular 
values of arbitrary matrices. 

Although many of the theorems we state here have not appeared elsewhere, their 
proofs are mostly "soft" algebra based on the hard geometric or combinatorial work 
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of others. Indeed, this paper emphasizes concrete elementary arguments. We do 
give some new examples and counterexamples, and raise some new open questions. 

We have attempted to point to the sources and to some of the key partial re- 
sults that had been conjectured or proved before. However, there is a very large 
literature, particularly for linear algebra problems about eigenvalues, singular val- 
ues, and invariant factors. We have listed only a few of these articles, from whose 
bibliographies, we hope, an interested reader can trace the history; we apologize to 
the many whose work is not cited directly. 

We begin in the first five sections by describing each of the problems, together 
with some of their early histories, and we state as theorems the new solutions to 
these problems. In Section 6 we describe the steps toward these solutions that were 
carried out before the recent breakthroughs. Then we discuss the recent solutions, 
and explain how these theorems follow from the work of the above mathematicians. 
Sections 7, 8, 9, and 10 also contain variations and generalizations of some of the 
theorems stated in the first five sections, as well as attributions of the theorems to 
their authors. 

One of our fascinations with this subject, even now that we have proofs of the 
theorems, is the challenge to understand in a deeper way why all these subjects are 
so closely related. It is a particular challenge in each of these areas to understand 
why the solutions can be described inductively. 

I am grateful to many people for advice and help in preparing this article, espe- 
cially: P. Belkale, A. Buch, L. Chen, J. Day, P. Diaconis, A. Knutson, R. Lazarsfeld, 
C. K. Li, A. Okounkov, Y. T. Poon, P. Pragacz, J. F. Queiro, P. Sarnak, F. Sot- 
tile, R. Steinberg, J. Stembridge, T. Y. Tam, H. Tamvakis, T. Tao, B. Totaro, C. 
Woodward, A. Zelevinsky, and a referee. 

1. Eigenvalues of sums of Hermitian and real symmetric matrices 

The first problem goes back to the nineteenth century: What can be said about 
the eigenvalues of a sum of two Hermitian (or real symmetric) matrices, in terms 
of the eigenvalues of the summands? 

It is a basic fact of linear algebra that all of the eigenvalues of any Hermitian or 
real symmetric matrix are real. We consider n by n matrices, with n fixed. If A is 
a real symmetric matrix, its eigenvalues describe the quadratic form — x^Ax 

in an appropriate orthogonal coordinate system. For example, if the eigenvalues 
are positive, the inverses of the square roots of the eigenvalues are half the lengths 
of the principal axes of the ellipsoid qA(x) — 1. 

We always list the n eigenvalues of such a matrix in decreasing order, including 
any eigenvalue as often as its multiplicity. If A, B, and C are Hermitian n by n 
matrices, we denote the eigenvalues of A by 

a : ai > Q!2 > . . . > ttn, 

and similarly write (3 and 7 for the eigenvalues (spectra) of B and C. The question 
becomes: 

What a, (3, and 7 can be the eigenvalues of n by n Hermitian (or real 
symmetric) matrices A, B, and C, with C = A + B7 

For example, one can fix A and take B small, thus regarding C as a deformation of 
A. In the real symmetric case, one is then asking how the shape of the hypersurface 
qc{x) = 1 compares with that of qA{x) = 1. 
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One can, if one wishes, take A = D{a) to be the diagonal matrix with entries 
Q!i, . . . , a„ down the diagonal, and B = U D{f})U* , with U a unitary (or orthogonal 
in the real symmetric case) matrix. We are looking for the eigenvalues of D{a) + 
U D{(3) U* as U varies over the unitary group U (n). This was the approach in much 
of the classical work on the problem, but it will not play an important role here. 

There is one obvious necessary condition, that the trace of C be the sum of the 
traces of A and B: 

n n n 

(*) 1^7i = ^Ci + ^Pi. 

i=l i=l i=l 

There is a long history of results that put additional necessary conditions on the 
possible eigenvalues. The first goes back more than a century, and is a reasonable 
exercise for a linear algebra class: 

(1) 71 < ai+(3i. 

The first significant result was given in 1912 by H. Weyl [W]: 

(2) 7i+i-i < Q^i + Pj whenever i + j — 1 < n. 

Here is a typical application of these inequalities. If A and B are Hermitian n 
by n matrices that differ by a matrix of rank at most r, then their eigenvalues a 
and p satisfy the inequalities 

cuk+r < Pk and /3k+r < ctk for 1 < A:, k + r < n. 

Indeed, these inequalities follow by applying (2) to the triples {B,A — B,A) and 
{A, B — A,B), with j — r + 1. When r = 1, this is an interlacing theorem: between 
any two odd numbered (or even numbered) eigenvalues of A there is at least one 
eigenvalue of B. There are similar results for eigenvalues of principal minors of A, 
which we describe in Section 6. 

In fact, for n = 2, it is not hard to verify directly that the conditions (*), (1), 
and (2), which say 

7i + 72 = ai + Q!2 + /3i + 02 

7i < ai + /?!, 72 < Q!2 + /3i, and 72 < ai + p2 

are both necessary and sufficient for the existence of 2 by 2 Hermitian (or real 

symmetric) matrices with these eigenvalues. 

In 1949 K. Fan [F] found some other necessary conditions: 

r r r 

(3) X] ^' - X! + X! r <n. 

i=l i=l i=l 

This question was featured in Gel'fand's seminar in Moscow. In 1950 V. B. 
Lidskii [LI], cf. [BG], found the following necessary condition. For this, regard an 
n-tuple a of eigenvalues as a point (ai, . . . , a„) in K". This result asserts that the 
point 7 must be in the convex hull of the points a + Per, where a varies over the 
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symmetric group Sn, and I3„ denotes (/3cr(i), ■ • ■ ,P(T{n))- Although this looks quite 
different from (1) - (3), H. Wielandt [Wi] showed that this geometric condition is 
equivalent to the inequalities: 

(4) E^^^E"^+E/5' 

ie/ i€l 4=1 

for every subset / of {1, . . . , n} of cardinality r, for all r < n. Of course, the same 
inequalities are valid when the roles of a and /3 are interchanged, and we include 
them in the list we are making. In 1962, A. Horn [H2] showed that the inequalities 
listed so far, with the addition of 

(5) 72 + 73 < CKl + Q!3 + /?! + 133, 

arc necessary and sufficient for the existence of 3 by 3 Hermitian matrices with 
these eigenvalues. 

Other inequalities were found, all having the form 

{*ijk) '^7k < E'^*+E^-?' 

keK iei jeJ 

for certain subsets I, J, K of {1, ... ,n} of the same cardinality r, with r <n. We 
always write the subsets in increasing order, so 

I = {ii < i2 < ■ ■ ■ < ir}, 

J = {ji < . . . < jr}, and K = {fci < . . . < kr}. 

For example, there is a result of L. Frccdc and R. C. Thompson [TF], which 
generalizes the Weyl, Fan, and Lidskii- Wielandt inequalities: 

(6) E ^'^ - E + E ^J' kp = ip + jp - p for 1 < p < r. 

keK iei jeJ 

One has these inequalities for any / and J, provided that ir + jr < n + r. 

Note that any inequality of the form {*ijk) can be subtracted from the equality 
(*) to give an inequality 

(7) E ^^^E"' + E/^^' 

keK': iei'' jeJ'^ 

where 7°, and K'^ are the complements of 7, J, and K in {1, . . . ,n}. We do 
not list any of these. 

In his remarkable paper [H2], Horn undertook a systematic study of such in- 
equalities. In fact, he prescribed sets of triples (7, J, 7C), and he conjectured that 
the inequalities {*ijk) for these triples would give both necessary and sufficient 
conditions for a triple (a, /?, 7) to arise as eigenvalues of Hermitian matrices A, B, 
and C with C = A + B. 

Horn defined sets T" of triples {I,J,K) of subsets of {!,... ,n} of the same 
cardinality r, by the following inductive procedure. Set 

(8) u: = {{I,J,K)\Y,i + Y,3=Y,k + r{r+ l)/2}. 

iei jeJ keK 
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All the triples that we have listed arc in U^. When r = 1, set T^" = U^. (The 
inequalities specified by (/, J,K) in are Weyl's inequalities (2).) In general, 

T;* = {(7, J, K) G I for all p < r and all {F, G, H) in T;, 

/eF geG /teJ? 

Horn's Conjecture. A triple (a,/?, 7) occurs as eigenvalues of Hermitian n by 
n matrices A, B, C with C = A + B if and only ifj^li = X] + X] A o,nd the 
inequalities {*ijk) hold for every {I, J, K) in T", for all r < n. 

B.V. Lidskii, son of V. B. Lidskii, announced a proof of Horn's conjecture in 
1982 [L2], but no details of this have ever appeared. 

Theorem 1. Horn's conjecture is true. 

In fact, from the recent work to be discussed below, the surprising inductive 
form of this eigenvalue problem can be given in another way. To state this we use a 
standard correspondence between finite sets I = {i\ < 12 < ■ ■ ■ < ir} of r positive 
integers, and partitions A = (Ai > A2 > • • • > \) of nonnegative integers of length 
at most r. This correspondence is obtained by defining 

(10) A = A(/) = (v-r,v_i-(r-l),... ,i2-2,h-l). 

We let J correspond to /it, and K correspond to u, by this same recipe. When we 
say (A, /i, jy) corresponds to (/, J, K), we mean (A, /i, = (A(/), A( J), X{K)). (To 
recover the subsets from the partitions, the integer r must be specified, but this 
will always be the case for us.) 

If {I,J,K) corresponds to (A, /i, z/), note that {I,J,K) is in J7" exactly when 
J2'^i = S + X] Mi- Note also that each of A, fx, and is a sequence of r real num- 
bers listed in decreasing order, so it makes sense to ask if they arise as eigenvalues 
of a triple of Hermitian r by r matrices. 

Theorem 2. A triple (/, J, K) is in T" if and only if the corresponding triple 
(A, fi, u) occurs as eigenvalues of a triple of Hermitian r by r matrices, with the 
third the sum of the first two. 

For example, one can use this theorem to find (all) such triples with r = n — 1. 
They consist of complements of integers i,j, and k with i + j = n + k. The triples 
(A, /X, I/) corresponding to these subsets consist of partitions with only I's and O's, 
with n — i, n—j, and n — fc I's in A, /z, and v respectively. These occur as eigenvalues 
of diagonal matrices of size n— 1 with only I's and O's as entries; the first matrix has 
n — i I's at the beginning of the diagonal, the second has n — j I's at the end, and 
their sum has a total of n — fc I's at the ends. Prom (7) this gives the inequalities 

(11) 7fc > + /3j if i+j = n + k. 

For another example, the triple (/, J, K) = ({1, 3, 5}, {1, 3, 5}, {2, 4, 6}) is in Tl. 
In this case the corresponding triple (A, /z, i^) = ((2, 1, 0), (2, 1, 0), (3, 2, 1)) arises 
from the triple of diagonal 3 by 3 matrices with diagonal entries (2, 1,0), (1,0,2), 
and (3,1,2). 
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We will give several other characterizations of the sets T" in terms of Littlewood- 
Richardson coefficients (see Theorems 11 and 12) in Section 3. 

Horn's paper also contains results and examples about the case of real symmetric 
matrices, but he did not explicitly make his conjecture for this case. In fact, some 
of the methods that were used to prove inequalities broke down in the real case 
(see Example 1), so there was some doubt whether the real symmetric and complex 
Hermitian cases would coincide; a positive answer, however, was conjectured in 
[DST]. Theorems 1 and 2 arc indeed true in the real symmetric case: 

Theorem 3. A triple (a, /3,7) occurs as eigenvalues for a triple of real symmetric 
matrices if and only if it occurs for a triple of complex Hermitian matrices. 

There have also been a few cases of inequalites {*ijk) proved for Hermitian (self- 
conjugate) quaternionic matrices (e.g. [X]). The complete answer, as suggested by 
Steinberg and Totaro, is the same as in the real and complex situations: 

Theorem 4. A triple {a, (3,^) occurs as eigenvalues for a triple of quaternionic 
Hermitian matrices if and only if it occurs for a triple of complex Hermitian ma- 
trices. 

V. B. Lidskii and Horn investigated the situation when some inequality {*ijk) 
becomes an equality, and they stated and proved special cases of the following 

theorem. 

Theorem 5. // Hermitian matrices A, B, and C — A + B have eigenvalues a, (3, 
and 7, and some inequality {*ijk) occurs with equality for {I,J,K) in T", then 
there is an r-dimensional subspace L of C" such that A, B, and C map L into 
itself. 



block diagonal form ( ,1^ ) with P r by r and Qn — rhyn — r Hermitian matrices. 



This means that, after a unitary change of coordinates, the three matrices have a 

p ' 



Example 1. The assertion of Theorem 5 is not true for real symmetric matrices 
and real subspaces of R", although it is true for n < 5. Here is an explicit example 
with n = 6. Take A to be any diagonal matrix with entries {x,x,y,y, z, z) down 
the diagonal, with x, y, and z distinct real numbers that sum to zero. Take B to 
be the matrix 

35 
19 

51 


2 / 

which has eigenvalues (56, 56, 28, 28, -84, -84). Then C = A + B will also have 
eigenvalues consisting of three pairs of distinct numbers that sum to zero. The 
inequality {*ijk), with I = J = {1,3,5}, and K = {2,4,6} is an equality, since 
ai + az + ct^ = 0, /3i + /33 + /35 = 0, and 72 + 74+76 = 0. There are exactly 
two 3-dimcnsional subspaces preserved by A and B (and therefore C), and both 
are complex; letting ei, . . . ,e% be the standard basis for C", they are the span L 
of ei + ie^, 63 + ie^, and 65 + zee, and its orthogonal subspace Z/"*", spanned by 
ei — ie2, 63 — ^65, and 65 — ie^. (The fact that C preserves these subspaces implies 





15 





-32 


-3 


35 





15 


3 


-32 


3 




-32 


3 


-17 





51 




-3 


-32 





-17 


-19 




35 


3 


51 


-19 


2 


V 


-3 


35 


19 


51 






EIGENVALUES AND SCHUBERT CALCULUS 



7 



that its eigenvalues come in pairs as asserted.) We will see some explanation for 
this example later (cf. [TT2]). 

It should be understood that one is looking for a minimal, or at least a small 
set of inequalities {*i.jk)- Any such inequality determines many others, and one 
usually does not want to list these trivial consequences. For example, from (5) 
one can deduce immediately that 72+74 < Qi + as + /3i + /32, simply because 
74 < 73 and /Sa < /?2. As A. Buch points out, there is a special situation for n = 2, 
as the inequalities ai > 0:2, /3i > (^2, and 71 > 72 follow from the equality (*) 
and the three inequalities {*ijk)- With this exception, the inequalities in Horn's 
conjecture are in fact minimal for n < 5, which is the region that Horn investigated 
most thoroughly. We will see, however, that they are not minimal for larger n, 
although that was thought to be the case until quite recently. 

Although the triple (/, J, K) = ({1, 3, 5}, {1, 3, 5}, {2, 4, 6}) is on Horn's list, as 
we have seen, the inequality 

(12) 72 +74 + 76 < "1 + "3 + "5 + Pi + + 

is redundant, and follows in fact from (*) and the fact that the eigenvalues are 
listed in decreasing order. Indeed, let aov = a2 + + ae, and aod = 0:1+ 0:3 + as, 
and similarly for /3 and 7. The fact that each of a, (3, and 7 is nonincreasing 
implies the inequalities acv < ctod, Pev < Pod, and 7ov < 7od- But (*) says that 
7od + 7cv = aod + acv + Pod + Pev Hencc 7ev < aod + Pod, as asserted. 

In fact, for n = 6, this is the only triple on Horn's list that can be omitted 
(and the only triple that gives rise to an example like Example 1). As n increases, 
however, the number of redundant triples on the list increases rapidly. Describing 
the actual minimal set, however, requires notions from other areas of mathematics, 
and is postponed to later sections. 

As this example indicates, one reason why the inequalities defining the realiz- 
able triples (a. /?. 7) pose problems is because, together with equation (*) and the 
inequalities {*ijk), one also has the 3n — 3 inequalities ai > a2 > . . . > a„, 

/3l >/32 > ... >/3n, 71 >72 > ... >7n. 

There is also a literature describing the possibilities for a particular eigenvalue 
7fe (with k fixed) of C = ^ + B in terms of the eigenvalues of A and B. In fact, 
this problem was solved much earlier (see [Joh]). The result is that the k^^ largest 
eigenvalue of C can take on any value in an interval: 

(13) Max ai+Pj < -fk < Min at+Pj. 

i+j=n+k i+j=k+l 

We have seen in (2) and (11) that 7^ satisfies these inequalities, and since the set 
in question is a projection of a convex set in R" (or the image of the connected 
space U{n) by a continuous map) it must be an interval. It therefore suffices 
to produce Hermitian matrices to achieve each of the endpoints of the displayed 
interval. Explicit diagonal matrices can be produced to achieve this [Joh]. This 
argument also shows that this question has the same answer for real symmetric 
matrices. 

More generally, one can specify a subset K oi {1,. . . ,n}, and ask for the possible 
values of {7^ | k e K}, again with a and P given. In principle, such results may be 
deduced from Theorem 1, although carrying this out does not appear to be easy. 
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One always has the inequahties {*fgh) given for triples {F, G, H), for H a subset 
of K, and the duals (7) of inequalities {*fgh) when H contains the complement of 
K, together with the inequalities saying that the eigenvalues in K form a weakly 
decreasing sequence. But already for n = 3 and if = {1,3} it is easy to see that 
such inequalities do not suffice: they do not imply that the missing 72, which is 
determined, is at least as large as 73. 

Not all inequalities that have been found involving eigenvalues of sums of Her- 
mitian matrices are linear. For example, M. Fiedler [Fi] showed that 

n n n 

(14) Min TT(ai + < J^Ti < MaxTT(ai + 

There are still some basic questions remaining. One of these is the following: 

Which triples (/, J, K) of subsets of cardinality r in {1, . . . , n} give true 
inequalities {*ijk) for eigenvalues of all n by n Hermitian matrices A, 
B,C= A+B? 

Let us call this set iJ". This question was addressed by Horn [H2] and then by 
Zwahlen [Zw]. Theorem 1 asserts that C iJ", but how much larger is H^l 

For r < 2, the answer is quite simple, and was given by Horn and Zwahlen. The 
set Hi consists of triples with ii + ji < ki + 1. The set iJJ consists of triples such 
that h+ji < ki + l, 12+31 < fe + l, ii+j2 < fe + l, and ii+i2+ji+j2 < ki+k2+S. 
In fact, they show that for r < 2 and any triple not satisfying these conditions, there 
are diagonal matrices A, B, and C = A+B whose eigenvalues violate the inequality 
{*ijk)- They prove some partial results for r = 3. On the basis of the evidence in 
these papers, it was natural to hope that, if (/, J, K) is given to be in ?7", then it 
is in H'^ only if it is in T;*, i.e., Hi;^ n Ul^ = T^^. However, this is not true: 

Example 2. Take r = 5, n = 25, and / = J = {1,3,4,16,21} and K = 
{5,10,15,20,25}. This is in n U^. Indeed, it is easy to sec, by the same 
argument as in the proof of (12), that for any weakly decreasing n-tuples a, /?, and 
7 such that E 7i = E + E A, 

4 4 4 

^ 7fe = X! 75+5S < X] "1+5* + X] ^1+5* 
keK s=0 s=0 s=0 

iei jeJ 

To see that {I,J,K) is not in T^^, one can use the triple {F,G,H) in T|, with 
F = G ^ {1,2,4,5} and H = {2,3,4,5}: EfeF^f + EgeG jg = 82 > 80 = 
E/teH l^h + p{p + l)/2- Many other examples can be constructed by this method, 
but none of them are very small. 

Zwahlen [Zw] gave an example of a triple (/, J, K) for which he produced vio- 
lating matrices, but for which no diagonal matrices violate {*ijk)] this was, for 
r = 3, n = 18, / = J = {1, 6, 11}, K = {2, 9, 18}. Thompson and Therianos [TTl] 
gave the simpler triple ({1, 3, 5}, {1, 3, 5}, {2, 3, 6}) for r = 3, n = 6, with the same 
property. These triples, however, are not in U^. 
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Examples (Buch). The triple (/, J, if) = ({1, 3, 5, 6}, {1, 3, 5, 6}, {2, 3, 6, 9}) is in 
J/l \ rl , but no diagonal 9 by 9 Hermitian matrices can have eigenvalues violating 
{*ijk)- If A is the diagonal matrix with diagonal entries (2,0,1,2,0,1,0,0,0), 
and B is the direct sum of the matrix ^ ) the diagonal matrix with 

diagonal entries (2.1,1,0, 0, 0, 0), then the eigenvalues of A, B, and C = A + B are 
a = /3 = (2, 2, 1, 1, 0, 0, 0, 0, 0), and 7 = (3, 3, 3, 1, 1, 1, 0, 0, 0). These violate {*ijk)- 

These examples indicate some of the subtlety of identifying triples that give 
correct inequalities, and finding violating matrices for triples that do not. 

It is interesting to note that much of the extreme behavior can be detected by 
matrices that arc diagonal or close to diagonal, at least in low dimensions. Indeed, 
this must have been how many of the inequalities were discovered. However, we will 
see that the proofs of the theorems are almost opposite to this: the eigenvectors of 
the matrices produced by the proofs are in general position. 

The theorems of this section give a clear picture of the set of all triples (a, /?, 7) 
that occur as spectra of Hermitian n by n matrices A, B, and C = A + B. For small 
n they give a reasonable set of inequalities that one can use to test if a particular 
triple (a, /?, 7) occurs. For large n, however, the number of inequalities to be tested 
increases dramatically (even if decreased by using Theorem 13 below). When all of 
a, /?, and 7 arc integral, one can test directly if {a, (3, 7) occurs by using Theorem 11 
below. When the eigenvalues are rational, one can multiply them all by a common 
denominator to reduce to the integral case. We do not know a similarly direct 
criterion for arbitrary real eigenvalues. 

The necessary conditions of the theorems extend readily from the realm of finite 
dimensional Hermitian operators to that of compact self-adjoint operators on a 
Hilbcrt space. Indeed, it was in this context that Wcyl [W] stated his results, cf. 
[Zw]. Such an operator A has a sequence of positive eigenvalues ai > a2 > as > 
. . . (each occurring according to multiplicity), and a similar sequence of negative 
eigenvalues. For simplicity we consider only the positive eigenvalues: 

Theorem 6. Suppose A, B, and C ~ A + B are compact self-adjoint operators 

on a Hubert space, with a, (3, and 7 their sequences of positive eigenvalues, and 
assume that each of these sequences is infinite. Then {*ijk) holds for all (/, J, K) 
in T", for all n and r with r < n. 

The theorems here describe all possible eigenvalues of A + B when A and B are 

n by n Hermitian matrices with eigenvalues a and /?. In contrast, P. Bianc [Bi] has 
shown that, as n gets large, for almost all choices of A and B, the eigenvalues of 
A + B are close to some 7 that depends only on a and /3. 

Much of the history of the eigenvalue problem before the recent events, together 
with an extensive bibliography, can be found in the survey [DST]. 

2. Invariant factors 

We turn now to quite a different problem. Consider an n by n matrix A with 
coefficients in a discrete; valuation ring R, whose determinant is not zero. Let tt 
be a uniformizing parameter in R. The cases that have been most studied are 
when R = C{z} is the ring of convergent power series in one variable, with n = z, 
or when R = Zp is the ring of p-adic integers, with n = p. By elementary row 
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and column operations^, one can reduce A to a diagonal matrix, with diagonal 
entries 7r"i , n"^ , tt"" , for unique nonnegative integers ai > . . . > a„. We call 
a = (ai, . . . , an) the invariant factors (or Smith invariants) of A. The question 
in this case is: 

Which a, (3, 7 can be the invariant factors of matrices A, B, and C if 
C = A-B7 

In the case of convergent power series, this problem was proposed by I. Gohberg 
and M. A. Kaashoek, and was attacked particularly by Thompson and his coworkers 
(cf. [Thl], [Th2], [Th3], [Thi]). 

These matrices correspond to endomorphisms of i?", with cokernels being torsion 
modules with at most n generators. Such a module is isomorphic to a direct sum 

R/w'^m e R/w^m e . . . e R/'n°'^R, ai > . . . > a„. 

We call a = (ai, . . . , Q!„) the invariant factors of the module. Denoting cokernels 
of ^, B, and C by ^, B, and C, one has a short exact sequence 

i.e., jB is a submodule of C, with C/B = A. Conversely (cf. [Th3]), such an exact 
sequence corresponds to n by n matrices A, B, and C with C = A ■ B. This 
correspondence is seen easily by applying the "snake lemma" (see e.g. [La] §111.7) 
to the diagram 



Note that the equality (*), that X]7i = X]'^* + X] is obviously satisfied, since 
the determinant of a product is the product of the determinants. It is also not 
difficult to verify that inequality (1) must also be valid. Thompson [Thl] proved 
(6) in this setting. 

When R is the ring of p-adic integers, one is asking what finite abelian p-groups 
can appear in a short exact sequence. The aim of workers in this field was to use 
the numbers of submodules S of a given module C with specified invariant factors 
for B, C, and A = C/B, as the structure constants to define a ring. In fact, this idea 
goes back to a lecture of E. Steinitz in 1900, but this was lost until the 1980's. Prom 
the 1940's to the 1960's, this theory was developed by P. Hall, J. A. Green, and T. 
Klein. The conclusion of this study that is relevant to our question was that a triple 
{a, P, 7) occurs for p-groups, or in fact for any discrete valuation ring, if and only 
if a certain nonnegative integer c^p, called the Littlewood-Richardson coefficient, 
is nonzero. These coefficients, which have a purely combinatorial description, arose 
in representation theory, and will be defined and discussed in the next section. 

Although there seems to be little relation between this problem and the eigen- 
value problem, they in fact have exactly the same answers. In particular, this 
implies the fact that the answer to this problem is independent of the discrete 
valuation ring. 



^The elementary row operations arc: interchanging two rows; adding to any row a multiple by 
an element of R times another row; multiplying any row by an invertible element of R. Similarly 
for elementary column operations. 
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Theorem 7. For any discrete valuation ring R, a triple {a, j3, 7) occurs as the 
invariant factors of A, B, and C = A ■ B if and only if Ylli = X] + X] A o-nd 
the inequalities {*ijk) are satisfied for all {I, J, K) in T", for all r < n. 

In fact, these results can be extended to any principal ideal domain R, such 
as the integers Z or a polynomial ring F[T] in one variable over a field. In this 
case the invariant factors of a matrix A with nonzero determinant are a chain 
oi C a2 C . . . C a„ of nonzero ideals in R. The matrix A can be reduced by 
left and right multiplication by matrices in GL(n,R) to a diagonal matrix whose 
successive entries generate the ideals in the chain; equivalently, the cokernel of A is 
isomorphic to ®"=ii?/ai. 

Theorem 8. For any principal ideal domain R, a triple (a, b, c) of chains of ideals 
occurs as the invariant factors of A, B. and C = A - B (or of torsion modules 
A, B, and C with at most n generators with B d C and C/B = A) if and only if 
Yli = YliC'i ' Yli bi, and, for all r < n, and all {I, J, K) in T", we have 

keK i£i jeJ 

One can also consider matrices with entries in the quotient field of R, in which 
case the sequences a, f3, and 7 may contain negative integers, and the ideals o, b, 
and c may include fractional ideals. Theorems 7 and 8 extend immediately to these 
situations, by multiplying the matrices by scalars to get all entries in R. 

This theorem also gives a solution of a problem called the Carlson problem ([C], 
cf. [Th2], [SQS]). The general theorem is the foUowing: 

Theorem 9. Let A and B be p by p and q by q matrices with entries in a principal 
ideal domain R, with nonzero determinants, and with invariant factors oi C . . . C 
Op and 61 C . . . C 6q . Then the possible invariant factors for a matrix of the form 

(A X 
- ' B 



for X a p by q matrix with entries in R, are those Ci C . . . C c„, with n = p + q, 

1% 



for which Hl^i Ck = HLi " HLi "^'^ 



Jl Cfe D Oi JJ 6j 

i<P j<q 

for all {I, J, K) in T" and all r < n. 

An arbitrary s by s matrix A with entries in a field F has invariant factors 

ai C . . . C ftp in F[T]: these arc the invariant factors of the _F[r]-modulc F", 
where T acts hy T ■ v = Av for v € F" . The following theorem solves the original 
Carlson problem ([C]): 

Theorem 10. Let A and B be s by s and t by t matrices over a field F. Let 
ai C . . . C ttp and bi C . . . C bq be the invariant factors of A and B. Then the 

possible invariant factors of a matrix C = ^ ^ ^ ) j with X an s by t matrix with 
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entries in F, are those Ci C . . . C c„, with n = p + q, that satisfy the conditions of 
Theorem 9. 

For example, ii A = B = (^qq^j the invariant factors are ai = 61 = (T^), with 
p = q = 1. There arc three possible invariant factors of C: (T^) C (T^), which 
occurs with X = (T^) C (T), with X = and (T*) C (1), with 

X = Note that p and q are usually smaller than s and and the matrices 

A, B, and X have different meanings in Theorems 9 and 10. 

For an application, suppose A is a nilpotent matrix, with Jordan blocks of sizes 
ai > a2 > . . . > ap, and S = is a g by g matrix. We apply Theorem 10 with 
a, = bj = t^i with /3j = 1, and Ck = f''. In this case a rule of Fieri, given in 
equation (16) in the next section, implies that the possible C have Jordan blocks 
of sizes 71 > . . . > 7f , with p < t < p + q, Y,"fi = + p, + 1 > > ioi 
1 <i <p, and ji = 1 for p < i < t. This result has been used by C. R. Johnson and 
E. A. Schreiner [JS] to give a quick proof of a theorem of H. Flanders characterizing 
which pairs (C, D) of an m by m matrix C and an n by n matrix D have the form 
{A - B,B ■ A) for some m by n matrix A and some n by m matrix B. 

Unlike the situations we have seen previously, in Theorems 9 and 10 the matrices 
A and B, and not merely their invariant factors, can be specified arbitrarily in 
advance. 

The inequalities (13) also tell, for a fixed k between 1 and n, and fixed partitions 
a and /?, exactly which integers can be for some triple (a, /3, 7) that occur as 
invariant factors. As before, the necessity we know. To prove sufficiency it suffices 
to construct matrices that realize the possibilities. This has been done by J. F. 
Queiro and E. Marques de Sa [QS]. In light of the stronger results we now have, 
this raises the question of what can be said about other subsets. 

3. Highest weights 

An irreducible, finite-dimensional, holomorphic representation of GL„ (C) is char- 
acterized by its highest weight, which is a weakly decreasing sequence 

a = (ai > a2 > ■ ■ > a„) 

of integers. For example, the representation /\'^(C") corresponds to the sequence 
(1,1,... ,1,0,... ,0) consisting of k l"s and n — k O's, and the representation 
Sym'^(C") has highest weight (fc,0, . . . ,0). In general such a representation con- 
tains a nonzero vector v (called a highest weight vector), such that for any upper 
triangular matrix X in GL„(C), whose entries down the diagonal are xi,X2, ■ ■ ■ ,x„, 

X ■ V = x'i' x"^-" ■ ■ ■ x'^- V. 

We denote the irreducible representation with highest weight a by V{a). It is a basic 
fact of representation theory that GL„(C) is reductive. This means that any finite- 
dimensional holomorphic representation decomposes into a direct sum of irreducible 
representations, and the number of times that a given irreducible representation 
F(7) appears in the sum is independent of the choice of the decomposition. In 
particular, for any a and /3, the tensor product V{a) (g) V{(5) decomposes into a 
direct sum of representations 1^(7). Define c^jj to be the number of copies of ^(7) 
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in an irreducible decomposition of V{a) ^ V{(3). The problem of interest in this 
situation is: 

When does V{'y) occur in V{a) V{l3), i.e., when is cj^ > 0? 

It follows immediately from the definition of highest weights that a necessary con- 
dition for this is that X^7i = Qij + ^ Other conditions are more difficult to 
find, although an expert may attempt to prove some of the inequalities (1) - (7). 

A simple case of this problem is when /3 = (1,... ,1), so V{P) is the one- 
dimensional determinant representation. In this case V{a) (g) V{P) is equal to 
V{ai + 1, Q!2 + 1, • • • , Ctrl + !)• Ill particular, the problem is unchanged if each 
of the representations is tensored by this determinant representation several times. 
Therefore we may assume that each of a, /?, and 7 consists of nonnegative integers, 
i.e., is a partition. Equivalently, one need only consider polynomial representations 
of GLn{C). These representations were constructed in the nineteenth century by 
J. Deruyts [D]. For a simple construction of them, see [Fu2], §8.1. 

In 1934 Littlcwood and Richardson [LR] gave a remarkable combinatorial formula 
for the numbers cj^, although a complete proof of the formula was only given much 
later (cf. [Mac], [Fu2]). This formula is stated in terms of the Young (or Ferrers) 
diagrams corresponding to the partitions. The diagram of a is an array of boxes, 
lined up at the left, with ctj boxes in the i^^ row, with the rows arranged from top 
to bottom. For example. 



is the Young diagram of (5,4,3,1). We follow the convention, reinforced by the 
Young diagrams, of identifying two partitions if they differ by a string of zeros at 
the end. 

Their rule states that cj^^ is zero unless 7i = cti + Yl Pi ^.nd the Young 
diagram of a is contained in that for 7, i.e., < 7i for 1 < z < n. The complement 
of the Young diagram of a in that of 7, denoted 7x0;, then consists of ^ /3i boxes. 
We order the boxes of 7 \ a by first listing the boxes in the top row, from right to 
left, then the boxes in the second row from right to left, and so on down the array. 
The Littlewood-Richardson coefficient cj^ is the number of ways to fill the 
boxes of 7 \ a, with one integer in each box, so that the following four conditions 
arc satisfied: 

(i) The entries in any row are weakly increasing from left to right; 

(ii) The entries in each column are strictly increasing from top to bottom. 

(iii) The integer i occurs exactly fii times. 

(iv) For any p with 1 < p < ^ and any i with 1 < i < n, the number of 
times i occurs in the first p boxes of the ordering is at least as large as the 
number of times that z + 1 occurs in these first p boxes. 

(The last condition says that when the entries are listed, from right to left in rows, 
from the top row to the bottom, they form a lattice word.) 

For a = (3, 2, 1), /? = (3, 2, 2), and 7 = (5, 4, 3, 1), the following are some of the 
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ways to fill the boxes of 7 \ a satisfying the first three conditions. 
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The first three examples satisfy the fourth condition. The fourth example does not, 
since the first six boxes in the ordering have more 3's than 2's. One sees easily that 
the first three are the only possibilities satisfying all four conditions, so cj^ = 3. 

Two special cases of this rule were known to Picri in the context of Schubert 
calculus. Let a = (ai, . . . , a„). If /3 = {p), then the possible 7 for which cj^ ^ 
are those of the form (71, . . . , 7ti+i) with 

71 > ai > 72 > a2 > • • • > 7n >Oln > 7n+l > 0, 

with 7t = + p. 



(15) 



In these cases cj^ = 1. (For representations of GLn{C) only those with 7„+i = 
are allowed.) In terms of Young diagrams, (3 consists of a row of p boxes, and the 
diagram of 7 is obtained from that of a by adding p boxes, with no two in any 
column. 

The other Fieri rule is for /3 = (1, . . . , 1) consisting of p I's. The possible 7 with 
cj^ 7^ also have cj^ = 1, and these have the form 7 = (71, . . . ,7*) with 



(16) 



a, + 1 > 7i > a, for all i, and 



y 7i = y a. 



■p- 



Here /3 is a column of p boxes, and the diagram of 7 is obtained from that of a by 
adding p boxes, with no two in any row. 

Another case that is easy to analyze directly from the Littlewood-Richardson 
rule is the case when 7, = + /3j for all i. In this case one sees that cj^ = 1. 
These are the partitions which correspond to the triples of subsets listed in (6). 

Theorem 11. The Littlewood-Richardson coefficient c^^ is positive exactly when 
^ 7i = X] + S A ^''^'^ inequalities {*ijk) are valid for all {I, J, K) in T", 

and all r < n. 

Equivalently, in light of Theorem 1 , c is positive exactly when there is a triple 
of n by n Hermitian matrices A, B, C = A + B with eigenvalues a, /3, and 7. For 
example, if there are permutations cr and t in S„ such that (^^(i) + Prii) = 7i for 
1 < i < n, then cj^ must be positive, since there are diagonal matrices A, B, and 
C with these eigenvalues and C = A + B. For a combinatorial proof of this fact, 
which is a special case of the general (proven) "PRV conjecture," see [KT]. 

The first triple with cj'o greater than 1, i.e., with the smallest J2li^ ^ — 



(2,1,0), /? = (2,1,0), and 7 = (3,2, 
there are two arrays that satisfy (i) - 



1). In this case the coefficient is 2, because 
(iv): 
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Note that this triple corresponds to the triple ({1.3. 5}. {1. 3, 5}, {2, 4, 6}) of subsets, 
which appeared in examples in Section 1. The fact that this was the first triple to 
give a redundant Horn condition is not a coincidence. In fact, let us define i?" to 
be the set of triples (/, J, K) of subsets of cardinality r in {1, . . . , n} such that the 
corresponding triple {X,ijl,v) = {X{I) , \{J) , \[K)) has c^^ = 1: 

(17) K = {{I,J,K)eU:\c^^ = l}. 
Similarly, let 

(18) S^ = {{I,J,K)eU^\c^^^Q). 

The following theorem says that the triples occuring in Horn's conjecture are 
determined by the Littlewood-Richardson rule. 

Theorem 12. For all r <n, the sets 6"" and T" are equal. 

The sets of triples therefore have the following inclusions: 

(19) Ri! c s*;' = t;' c [/;'. 

Theorem 13. Given that 7i — ^^ i + Yl Pi > ""'^ ^-^■'^^t each of a, (3, and 7 formes 
a weakly decreasing sequence, the conditions {*ijk) for all {I,J,K) in T" and all 
r <n, are implied by the conditions {*ijk) for all {I, J, K) in i?" and all r <n. 

In the theorems involving inequalities {*ijk), Horn's sets T" can therefore be 

replaced by the smaller sets i?". (An indication of how much smaller these sets are 
can be found at the end of Section 9.) In fact, Knutson, Tao, and Woodward have 
announced [KTW] that the conditions {*ijk) for (/, J, K) in i?" are independent, 
i.e., none of them can be omitted. 

The Littlewood-Richardson coefficients also arise from the representation theory 
of finite symmetric groups. The complex irreducible representations of the sym- 
metric group Sa are indexed by partitions of a; for a partition a of a, let Va be 
this representation. If a is a partition of a, and /? is a partition of b, then the 
representation '8 is a representation of Sa x Sb- The Littlewood-Richardson 
number cj^ is the number of times Vy appears in the representation of Sa+t that 
is induced from K« ® Vg by the standard inclusion of Sa x Sb in Sa+b (cf. [Mac], 
§1.7 or [Fu2], §7.3). Theorems 11 and 12 therefore characterize the representations 
that occur in this decomposition. 

If a and /3 are given, the interval in (13) specifies exactly what can be the 
length 7fc of the A;*^ row of those 7 for which cj^ is not zero; this follows from 
the equivalent problem discussed in Section 2. Again there is the interesting open 
question of specifying the possible lengths {7^ | k € K} of a prescribed subset of 
rows of such 7. 

4. Schubert calculus 

Let X = Gr{n,C"^) be the Grassmann variety of n-dimensional subspaces L of 
C™. It is a complex manifold of complex dimension n{m — n). A complete flag 
F. is a chain = C J'l C F2 C . . . C J'™ = C" of subspaces of C", with 
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dim(_F'i) = i for all i. For any subset P = {pi < p2 < ■ ■ ■ < Pn} of cardinality n in 
{1, . . . , m}, there is a Schubert variety Qp{F.) in X defined by 

(20) = {LgX \ dim(L n Fp,) >i for 1 < i < n}. 

This is an irreducible closed subvariety of X of dimension — i). The homology 

classes ujp = [np{F,)] of these varieties are independent of choice of the flag, and 
they form a basis for the integral homology of X . For each partition a with 

m — n > ai > a2 > ■ ■ ■ > oin > 0, 

let |a| = X^o^ii ^^'^ define aa to be the cohomology class in whose cap 

product with the fundamental class of X is the homology class ojp, where P is 
defined by setting pi = m — n + i — Ui. Identifying cohomology and homology by 
this Poincare duality isomorphism, we have: 

(21) aa=uip, where at = m — n + i — pi, l<i<n. 

These classes aa form a Z-basis for the cohomology ring. It follows that for any 
such partitions a and /3, there is a unique expression 

for integers rf^-Jij' ^^'^ ''^^ o'^'^^ 7 with X] 7s = S + 5Z A- It is a conse- 
quence of the fact that GLm{C) acts transitively on X that all these coefficients 
are nonnegative. The problem in this context is: 

When does appear in the product aa- Uj}, i.e., when is the coefficient 
^ai3 positive? 

Algebraic geometers in the nineteenth century, especially Schubert. Fieri, and 
Giambclli, gave algorithms for computing in this cohomology ring (in spite of the 
fact that cohomology was not invented until many decades later). The Giambclli 
formula is a determinantal expression for any ctq in terms of the basic classes ak, 
1 < k < m — n, where cTfe = (T(fe), and (fc) denotes the partition (fc,0, . . . ,0). The 
(original) Fieri formula (see (15)) gives the product ak ■ ap. However, the classical 
geometers did not give a general closed formula for the coefficients d^^, nor did 
they give a criterion for these coefficients to be positive. 

Theorem 14. The class a^^ occurs in aa ■ ap exactly when X] 7i = X) ctj + A 
and the inequalities {*ijk) are valid for all {I,J,K) in T", and all r < n. 

This is stated for complex varieties and usual cohomology, but the same is true 
over any field, if one uses Chow groups instead of homology. 

For later use we state another of the fundamental facts about intersection theory 
on Grassmannians (cf. [Fu2], §9.3). Note that 

(22) jj2n(m-n)^ = 'L-ap, 

where p consists of the integer m— n repeated n times, i.e., ap = ,r} is the class 
of a point in X. We identiiy this top cohomology group with Z. The dual of a class 
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(T~f is the class a^i , where 7' — (m — n — j„,... ,to — n — 72,?n — n — 71) . This means 
that, for \S\ + I7I = n{m — n), the intersection number ■ ag in ij2n(m-n)js^ _ ^ 
is 1 if 5 = 7', and otherwise. Therefore, when J2li = X] "^j + X] A' t;he class 
occurs in aa ■ (T/3 exactly when the product <Ta ■ (T/s ■ a^i is not zero. In fact, 

(23) Oa- o^- a^i = d^^cFp- 

We will use an important fact that is part of this Schubert calculus. Suppose, 
for each 1 < s < fc, P{s) = {pi{s) < ... < p„(.s)} is a subset of {1,... ,m} of 
cardinality n, and F,{s) is any complete flag. Then the intersection of the Schu- 
bert varieties ns=i (s)(-^»('S)) must be nonempty if the corresponding product 
Yls=i '^P(.'s) i'' uot zero. This is a special case of a general fact in intersection theory, 
that the intersection of classes of varieties has a representative on the intersection 
of the varieties (cf. [Ful]). The converse is also true if the flags -F.(s) are in general 
position. That is, there is a dense open set (the complement of a closed algebraic 
subset in the product of k flag varieties) of fc-tuples of flags which are in general 
position. This is a special case of Kleiman's transversality theorem [Kle]. 

5. Singular values of sums and products 

We regard C" as the space of column vectors, and we use the standard Hermitian 
inner product, denoted {u, v) = ^"-i UiVl for u and v in C"; |u| denotes the square 
root of {u,u). 

An arbitrary real or complex to by n matrix A has singular values ai > 02 > 
. . . > ag > 0, where q is the minimum of to and n. They can be defined as follows. 
The largest singular value ai is the maximum of |(^ui,t;i)| as u\ varies over unit 
vectors in C", and vi varies over unit vectors in C". Choosing u\ and Vi to achieve 
this maximum, define 

02= Max \{Au2,V2)\. 

|li2 1 = 1, U2-LU1 
|j;2| = l, V2-i-Vl 

And so on for (13, . . . , a^. In fact, there is a unitary n by n matrix U , a unitary m. by 
TO matrix V, and an to by n matrix D that has ai, . . . , down the diagonal and is 
otherwise zero, such that A = V DU*. This presentation is called a singular value 
decomposition of A. The vectors tii and Vi can be taken as the z"^ columns of U 
and V respectively, for 1 <i < q. This decomposition is important in multivariate 
analysis, where the entry aij describe the j^^ property of an i*^ object; changing 
bases by U and V gives the best coordinates to distinguish correlations among the 
properties. It is also used in numerical algorithms. Geometrically, if A is real, 
the image ^4(5'""^) of the unit sphere is an ellipsoid in M™, and the lengths of its 
principal axes are twice the singular values of A. 

These singular values can also be determined in other ways. For example, if 
m < n, then a^, . . . , are the eigenvalues of the Hermitian matrix A A*; if n < m, 
one can similarly use A* A. In general, ±A and ±A* have the same singular values 
as A. A characterization we will use is that the eigenvalues of the to + n by to + n 
Hermitian matrix 

A' 
A* 

are ai>ai>...>ag>0>...>0> —aq > . . . > —02 > — ai, where the number 
of zeros in the middle is the difference between m and n. The problem we consider 
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in this setting is: 

What a, b, and c can be the singular values of m by n matrices A, B, 
and C, with C = A + B1 

Given a = {ai > . . . > Uq > 0), set 

a = (ai > . . . > a, > > . . . > > -a, > . . . > -ai), 

a nonincreasing sequence of length m + n, and define similarly sequences /3 and 7 
from b and c. For each triple (7, J,K) of subsets of {!,... ,m + n} of the same 
cardinality r < m + n, the inequality on a, (3, and 7 determines a corre- 

sponding inequality on a, 6, and c, that we denote by For example, if 

m = n = 2, r = 1, and / = {3}, J = {1}, K = {3}, the inequality 73 < as + /3i 
becomes the inequality — C2 < —02 + or 02 < C2 + 61. In general, setting 
/' = {i G {1, . . . ,m + n}\m + n + 1 — i G /}, and similarly for J and if, the 
inequality is 

{**ijk) m Cfe - ^ Cfc < - + - bj. 

keK kGK' iei iei' jeJ jeJ' 

k<g k<q i<g i<q j<q j<q 

Theorem 15. A triple a, b, c occurs as singular values of m by n complex matrices 
A, B, and C with C = A + B, if and only if the inequalities {**ijk) are satisfied 
for all {I, J, K) in T™+", for allr<m + n. 

Many special cases of the necessity of these conditions had been known, cf. [AM] , 
[TTl], [QS]. 

In contrast to the eigenvalue problem discussed in the first paragraph, the sit- 
uation for real matrices is not the same as that for complex matrices. For 1 by 1 
complex matrices, a triple of nonncgative numbers occurs whenever each is at most 
the sum of the other two; for real matrices, one must be equal to the sum of the 
other two. Here is a more interesting example: 

Example 4. For 3 by 3 matrices, the triple a = (1,1,0), b = (1,1,0), and c = 

(1, 1, 1) occurs as singular values of complex matrices A, B and C = A + B, but 
they do not occur for real matrices. In the complex case, A and B can be diagonal 
matrices with entries (1, C, 0) and (0, 1), with ^ a primitive 6*^ root of unity. It 
is a straightforward calculation to verify that they do not occur for real matrices. 

When one writes out the inequalities of Theorem 15, one finds that most of them 
are redundant, and that the essential inequalities remaining have quite a simple 
form. For example, for 4 by 4 matrices, the triple ({1, 3, 5}, {2, 3, 5}, {2, 4, 7}) in T§ 
gives the inequality C2 + C4 — C2 < ai + 03 — + b2 + b^ — b^. This inequality follows 
from the inequality C4 < 03 + 62 and the inequalities < ai and 64 < 63. The 
triple ({3, 7}, {2, 3}, {4, 8}) in T^, on the other hand, gives the inequality C4 — Ci < 
(13 - 02 + ^2 + 63, or 

(24) a2 + C4 < ci + 03 + 62 + bs, 

which is in fact essential: it determines one of the facets of the cone defined in 
Theorem 15. 
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For simplicity, let us consider only n by n matrices. For n = I, minimal in- 
equalities defining the possible singular values are: Ci < ai + 61, 61 < ai + ci, and 
«i < foi + ci. (In this case the inequalities oi > 0, 61 > 0, and ci > follow.) For 
n > 2, Buch has a precise conjecture for producing a minimal set of inequalities: 
they are the 3n inequalities 

(**) fli > 02 > . . . > a„ > 0, bi>b2> ...>b.n>0, ci > C2 > . . . > c„ > 0, 

and the inequalities {**ijk) coming from the following set of triples: The triple 
(/, J, K) must be in i?^", and the following two conditions must be satisfied: 

(i) None of /, J, or K contains a pair of the form {t, 2n + 1 — t} for any 
l<t<n. 

This means that there is no cancellation when one writes out the corresponding 

inequality {**ijk)- In particular, this requires r to be no larger than n. For a 
subset / of {1, . . . , 2n} of cardinality r, not containing any pair {t, 2n+l —t}, set 

I = {i e {1, . . . , n} I i e / or 2n + 1 - « e /} = {Ii < Z2 < • • ■ < v}, 
and define J and K similarly from J and K. The second condition is 

(ii) {P <r\kj, i K} = {p <rW i 1} W {p <r\~jp i J}, 

where ]J denotes disjoint union. This condition guarantees that when the inequality 
{**ijk) is written out as an inequality with all positive coefficients, as we did in 
the example (24), there are r terms on the left, and 2r terms on the right, and, for 
each position p between 1 and r, exactly one of the terms {a;j^,6j ,Cj } is on the 
left, and the other two are on the right. 

Conjecture (Buch). For n > 2, the inequalities (**) and {**ijk), for {I,J,K) 
inR^'^, r <n, satisfying (i) and (ii), define the facets of the cone of singular values 
for n by n matrices. 

Buch has verified this conjecture for n < 4. Independently, L. O'Shea and R. 
Sjamaar [OS] have speculated how a complete set of inequalities can be obtained 
by looking at Schubert calculus with Z/2Z coefficients in real flag varieties. 

In the case of square n by n matrices A, B, and C, one may also ask the similar 
question for products: 

What a, b, and c can be the singular values of A, B and C with C = A-Bl 

The answer again is controlled by the triples in T" (or i?"): 

Theorem 16. A triple {a,b,c) occurs as singular values of n by n matrices A, B, 
and C = A - B if and only if 

kGK iei jeJ 
for all {I, J, K) in T", and all r < n. 

In 1950 Gel'fand and Naimark [GN] proved the special case of this theorem when 
K = I and J = {1, . . . ,r}. Many other special cases have been found since then, 
cf. [TTl]. 

As in the Hermitian case (Theorem 6), these results also extend to singular values 
of compact operators on a Hilbert space. 

For the question of which C/., for fixed k, can be k*"^ singular values of sums 
or products of matrices with given singular values, the answers are similar to the 
Hermitian case discussed in Section 1. For these results, see [QS]. 
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6. First steps toward the proofs 

We now start on the proofs of the theorems, together with some generalizations 
and complements. In contrast to the first five sections, from now on we will prove 
the propositions as we go along, and include references to the original articles. 

6.1. The Rayleigh trace. An important key to the understanding of the eigen- 
value problem was given in 1962 by J. Hcrsch and B. P. Zwahlen, based on then 
recent work of A. R. Amir-Moez and Wielandt, with ideas going back to Poincare, 
Weyl, Courant, and Fischer. If A is a Hermitian n by n matrix, with eigenvalues a, 
choose corresponding orthogonal eigenvectors Vi so Avi = UiVi. Let Fk{A) be the 
span oi v\,. . . ,Vk, so we have a complete flag F.{A) of C" (determined uniquely 
by A only if its eigenvalues are distinct). For any linear subspace L of C", define 
the Rayleigh^ trace Ra{L) by 

r 

(25) Ra{L) ^Y.(Au,,Ui), 

i=l 

where mi, . . . ,Mr is an orthonormal basis of L. This is independent of the choice 
of basis; indeed, Ra{L) is the trace of the composite L ^ C" — » C" — > L, where 
the first map is the inclusion, the second is given by A, and the third is orthogonal 
projection. The key fact, which links the sums occurring in the inequalities for the 
eigenvalues to Schubert varieties, is: 

Proposition 1 [HZ]. For any subset I = {ii < . . . < ir} of {1, ... ,n} of cardi- 
nality r, 

'S^ai= Min Ra{L). 
^ Lenj{F.{A)) 

The proof is straightforward. For any L in the Schubert variety ^i{F.{A)), and 

for any unit vector ui in L H Fi-^ {A), we have {Aui, ui) > Ui-^ . For any unit vector 
U2 perpendicular to ui in L n Fi^{A), we have {Au2,U2) > ai^- Continuing in 
this way, one sees that Ra{L) = X]p=i(^"p' "p) — X]iG/^«" '^^^^ inequality will 
be strict unless these choices can be made with Aup ~ on^Up for 1 < p < r. In 
particular, taking L spanned by Up = Vi^ for 1 < p < r, one obtains equality. 
For later use we note the following consequence of this argument. 

Corollary 1. For L in fli{F.{A)), the only way the equality J^iei'^i — Ra{L) 

can hold is if L is spanned by eigenvectors ui,... ,Ur such that A(up) = oii^tip 
for 1 < p < r. In this case A maps L {and therefore also L^) to itself. If the 
eigenvalues ai, . . . , a„ are distinct, then there is only one L in Oj(F.(A)) for which 
Y.i^iai = RA{L). 

The following two corollaries are proved by the same methods, although they 

are not really needed here. 

Corollary 2. Let F'^{A) be the flag with Fl.{A) spanned by the last k eigenvectors 
Vn+i-k-, ■ ■ ■ 1 Vn, o,nd Set I' = {i\n + 1 — i & 1} . Then 

Va, = Max Ra{L). 
j^j Lenj,{FiiA)) 



^Lord J. W. Rayleigh, a British physicist, used the ratios (Av, v)/{v, v) in physical problems 
involving eigenvalues. For example, the maximum (resp. minimum) of this ratio for all nonzero v 
is the biggest (resp. smallest) eigenvalue of A. 
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To prove this, construct a sequence of orthonormal vectors Up in L (1 F-, (A). 
One has {A Up, Up) < ai^^^_^ for 1 < p < r. Taking L spanned by , . . . , Vi^ gives 

equahty. 

Corollary 3. Let Al : L ^ L denote the composite L ^ C" ^ C" ^ L defined 
before the proposition, and let > . . .> be its eigenvectors. Then 

ctn-r+k <a^<Oik for I < k <r. 

For this, let wi,... ,Wr be an orthonormal basis of L with Ai(wj) = afwi, 
1 < i < r. The intersection 

{Wi,... ,Wk)ri{vi,... ,Vk-l)-^ = {Wi,... ,Wk}ri{Vk,... ,Vn) 

is not empty, and if w is a unit vector in it, then aj^ < (Av, v) < a^. Taking a unit 
vector in {wk, ■ ■ ■ , Wr) D {vi, . . . , v„-r+k) gives the other inequality. 

When L is the subspace spanned by some of the basic vectors of C", the matrix 
for Al is a principal minor of A, and the result of Corollary 3 is sometimes called 
the inclusion principle. 

With a little Schubert calculus, Proposition 1 leads quickly to a proof of the 
following proposition. 

Proposition 2 ([J], [HR], [K], [T]). Let A, B, and C be Hermitian n by n 
matrices, with C = A + B, and eigenvalues a, j3, and 7. Then for any r < n and 
any {I, J, K) in 5", the inequality {*ijk) is valid. 

To prove this, note first that the Rayleigh trace is linear in the matrices, so 

(26) R-a{L) + R-b{L) + Rc{L) = Ra{L) = 0. 

Note also that if ai > . . . > q;„ are the eigenvalues of A, then a'^ = —an ^ 
... > a'^ = —ax are the eigenvalues of —A. For a subset / of {!,... ,n}, let 
I' = {i\n + 1 — i & I}. This gives the equations 

(27) ^a^ = -^a,. 

It follows from the definition of Schubert classes (see (21)) that 

(28) ax(i)=<^i', 

where the notation is that of Section 4, but working now in the Grassmannian 
Gr(r,C"). In addition, 

(29) '^Ki) ^'^^ ^W) '^^^ dual. 

This means that ■ <^x(j') = if and only if / = J, where p = {n — rY , so ap 
is the class of a point. The following are therefore equivalent: 

(i) crA(K) occurs in ax(i) ■ (Jx(j), i.e., {I,J,K) € S^; 

(ii) ax{i) ■ ax{j) ■ (Jx{k') 0; 

(iii) 10 r -LOji -hJKi^^- 
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This last condition implies that the intersection iljr{F.{—A)) fl (— B)) fl 

^k{F.{C)) is not empty. Taking L in this intersection, this gives 

- - E + E ^''fe ^ + + i?c(i) = 0, 

and this proves Proposition 2. 

Remark. Using the equation A + B+ (— C) = 0, one deduces by a similar argument 
that J2keK' Ik < J2iGi' + SjeJ' /^i' where I' = {i\n + 1 - i ^ I}. In fact, 
however, a triple (/, J, is in 5" if and only if (/', J', /f') is in S'"_^. This follows 
from the isomorphism between Gr(r, C") and Gr(n — r, C") that takes a subspace 
L of C" to the kernel of the map C" = (C")* — L* . On cohomology this map 
takes ax to cta' , where A' is the conjugate of A. Note that A(/') is the transpose of 
A (sec Lemma 4 in Section 10). 

In 1962, and for some time after that, people working on this problem were not 
familiar with Schubert calculus, and it was only in 1979, in an unpublished thesis of 
S. Johnson [J], directed by Thompson, that the connection with Schubert calculus 
was made. Proposition 2 was later rediscovered by U. Helmke and J. Rosenthal 
[HR], Totaro [T] (in a different context), and Klyachko [KU]. Many papers proved 
special cases of the inequalities {*ijk) by showing by hand — using only linear 
algebra — that the intersection of Schubert varieties appearing in the above proof 
must be nonempty. 

6.2. Relations among the four main problems. The fact that the representa- 
tion theory problem is equivalent to the Schubert calculus problem has been known 
at least since 1947, when L. Lesieur [Le] proved that the Littlewood-Richardson 
coefficicints c^^i^ are the same as the coefficients d^fj that describe the multiplica- 
tion in the cohomology of the Grassmannian. In spite of all the known relations 
between Schubert varieties and representation theory, however, the proof is not 
very direct. It proceeds by showing that both are controlled by the same algebra 
of symmetric functions. The character of the representation V{a) of GZ/„(C) is the 
Schur polynomial Sq(xi, . . . , Xn). Corresponding to the decomposition of a tensor 
product one has the identity Sa{x) ■ Sis{x) = J2 s-y{x). These Schur polynomials 
form a basis for the ring A„ of symmetric polynomials in the variables Xi, . . . ,Xn 
(cf. [Mac], [Fu2]). On the other hand, there is a surjection of the ring A„ onto the 
cohomology of the Grassmannian Gr(n, C"), that takes Sa{x) to the class aa- The 
classical formulas of Fieri and Giambelli imply that this is a ring homomorphism, 
which proves the following proposition (see e.g. [Fu2] for this story): 

Proposition 3. The coefficients for multiplying Schubert classes are the same as 
those for decomposing tensor products, i.e., dj^ = cj^. 

Closer relations are known between the cohomology of the Grassmann variety 
G/P and representations of G stemming from work of Ehresmann and Kostant (see 
[Ko] ,§8) , but one would like a more direct explanation for the equality of Proposition 
3. 

Based on the work of Hall and Green, in 1968 Klein [K] completed the proof of 
the following proposition: 

Proposition 4. If R is a discrete valuation ring, and C is an R-module with 
invariant factors 7, then there is a submodule BofC with invariant factors (3 such 



EIGENVALUES AND SCHUBERT CALCULUS 



23 



that A = C/B has invariant factors a if and only if the Littlewood-Richardson 

coefficient c^p is positive. 

Although this estabhshes a relation between invariant factors and representation 
theory, the proof does nothing of the sort. Rather, it provides a link between such 
submodules and the Littlewood-Richardson algorithm. In fact, if B is a submodulc 
of C, the invariant factors ^{k) of C/n^'B form a chain of partitions a = 7(0) C 
7(1) C . . . C 7(t) =7. If a A; is placed in each box of j{k) \ -f{k — 1), Green 
showed that one obtains an array whose conjugate — obtained by interchanging 
rows and columns — satisfies the conditions (i) - (iv) of Section 2 for the conjugate 
partitions a', /?', 7'. Conversely, Klein showed that any such chain of partitions 
can be realized by some submodule B oi C. To complete the proof, one needs the 
fact that c^, p, = c^p (sec Lemma 4 in Section 10). 

There are polynomials g^i^iT), called Hall polynomials, whose top coefficient 
is cj^, such that whenever the residue field of R is finite of cardinality q, then 
g^jjiq) is the number of submodules B of C with invariant factors (3, such that C/B 
has invariant factors a. For a proof, with references, applications, and history, see 
Chapter II of [Mac]. 

Thompson and his coworkers, who worked on both problems, noticed that the 
answers to the Hermitian eigenvalue problem and the invariant factor problem 
appeared to be similar to each other, and were therefore related to Littlewood- 
Richardson coefficients by means of Proposition 4. 

Although this was only realized in 1997 by A. P. Santana, J. F. Queiro, and 
E. Marques de Sa [SQS], the following relation can be deduced from a 1982 theorem 
of G. J. Heckman [He] about moment mappings and coadjoint orbits: 

Proposition 5. If a, 0, and 7 are partitions of lengths at most n such that cj^ ^ 
0, then there are Hermitian n by n matrices A, B, and C with eigenvalues a, (3, 
and 7 and C = A + B. 

In fact, we will not need this proposition; it will follow from the other results 
proved here. We will discuss the coadjoint orbit and moment map approach in 
Section 8. Note, however, that with these five propositions, relations are established 
among all four of our subjects. For partitions a, /?, and 7 of lengths at most n, 
consider the following conditions: 

(i) (a, /3, 7) occur as eigenvalues of n by n Hermitian matrices A, B, and C = 
A + B; 

(ii) (a, (3, 7) occur as invariant factors (over one or every discrete valuation ring) 
of n by n matrices A, B, and C = A ■ B; 

(iii) the representation ^(7) of G'L„(C) occurs in V{a) ® V{P); 

(iv) occurs in aa ■ (^0 in the cohomology of Gr{n, C™) for any large m. 

Propositions 3 and 4 say that (ii). (iii). and (iv) arc equivalent, and Proposition 5 
says that they imply (i). See [SQS] for more about these relations. 

Note that the Littlewood-Richardson coefficients measure a multiplicity in each 
of the problems (ii), (iii), and (iv). It would be interesting to find a similar inter- 
pretation for the eigenvalue problem (i).'^ 

^Knutson reports that there is an interpretation of a suitable volume of a space of Hermitian 
matrices {A, B, C) with C = A + B and eigenvalues a, fi, and 7, in terms of Littlewood-Richardson 
coefficients, but asymptotically, using triples {Na, N/3,Nj) for large N. See also [DRW], §3. 
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Proposition 2 established a link in one direetion between the eigenvalue problem 
for n by n matrices and Schubert calculus in smaller Grassmann varieties Gr(r, C"). 
The next main step is to find a converse to this proposition. It should be pointed 
out at this point that it is not obvious that any of the four problems have solutions 
that can be described by any inequalities of the form {*jjk)- The fact that the 
eigenvalue problem can be described by some linear inequalities also follows from 
convexity properties of moment maps, which we will discuss in Section 8. 

7. Filtered vector spaces, geometric invariant theory, and stability 

We start by describing a theorem of Totaro [T], which, although written for quite 
another purpose, quickly yields the converse to Proposition 2. I thank R. Lazarsfeld 
for pointing me to this work of Totaro. Let V be an n-dimensional complex vector 
space. By a filtration V of V we shall mean a weakly decreasing sequence of 
subspaces 

V = V° DV^ DV^ D ...DVP D l/f+l D ... 

with the assumption that = for sufficiently large q. Any subspace L of ^ gets 
an induced filtration, by setting = LflV^. 

By an m-filtration of V is meant a collection V'{s) of filtrations of V, for 
1 < s < TO. For any nonzero subspace L of V, define its slope /x(L) with respect 
to this TO-filtration by 

^ ' s=l p>l 

Call the TO-filtration semistable (resp. stable) if /Lt(L) < /u(F) (resp. /u(I/) < 
n{V)) for all subspaces LofV,O^L^V. There is an obvious notion of a direct 
sum of m-filtered vector spaces, taking the p*'' subspace of the sum to be the sum 
of the p*'^ subspaces of the factors. An TO-filtration is called polystable if it is a 
direct sum of a finite number of stable m-filtrations of the same slope. 

Proposition 6 [T]. An m-filtration on V is polystable if and only if there is a 

Hermitian metric on V such that the sum of the orthogonal projections from V 
onto the spaces y(s), for p > 1 and 1 < s < m, is the scalar operator /i(V). 

The proof of this uses some geometric invariant theory. Totaro shows that a 
polystable m-filtration corresponds to a polystable point x in a product X of m 
partial flag varieties, which has a canonical embedding in a projective space P(-E), 
where E = (g)p>i /^dimVC*) y_ jg ^ ^.^g^j^ q Kempf and L. Ness from 

Geometric Invariant Theory, that if x is a representative of x in E, polystability 
is equivalent to the corresponding orbit SL{V) ■ x being closed in E. One then 
chooses a Hermitian metric on V (which induces a metric on E) to minimize the 
distance from x to the origin. The fact that this is a critical point implies, by a 
calculation, the asserted fact that the sum of the projcx'tions is a scalar. 

This proposition was used by Totaro to give a simpler proof of a theorem of G. 
Faltings, that the tensor product of semistable filtrations is semistable. Essentially 
the same argument was found by Klyachko [KU] for the eigenvalue problem. See 
Section 8 and [Fu3] for more details. 

If a = (ai > a2 > . . . > q;„ > 0) is a partition of length at most n, and if 
F, : = Fq c Fi c ... C Fn = V is a complete flag in V, define a filtration V' of 
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V by setting — F^'^, whore a' is the conjugate partition to a, i.e., a'p is the 
cardinality of the set {i € {1, . . . ,n} \ at > p} . Any filtration arises in this way, for 
some unique a and some flag F., although the flag is not unique unless a consists 
of n distinct integers. The following lemma is immediate from this deflnition. 

Lemma 1. Choose a Hermitian metric on V , and let A he the sum, of the orthog- 
onal projections of V on V^, for p > 1. Then A is a Hermitian operator whose 
eigenvalues are ai, . . . ,an- 

For any subset I = {ii < ... < ir} of {1, . . . , n} of cardinality r, we have the 
Schubert cell ^^(F.), defined by 

n°j{F.) = {Le Gr{r, V) \ dim(L n Fj) = k for ik < j < ik+i, 0<k<r}, 

where ig is defined to be and ir+i is defined to be n + 1; in other words, / lists 
those i for which LCiFi LCiFi^i. This cell is a dense open subset of the Schubert 
variety Qj{F.). Any subspace belongs to a unique Schubert cell, when the fiag F, 
is fixed. Prom this definition it follows that, for any ai, . . . , an, 

n n 

(31) ^ ai = ^ a, dim(L n Fi/L n = ^(a^ - ai+i) dim(L n Fi), 

i£l i=l i=l 

where an+i is defined to be 0. By a simple counting argument, this implies: 

Lemma 2. Let L be a subspace of V that is in the Schubert cell Cl°j{F,). Then 

E. e7«^=Ep>idim(LnT/f). 

Wc define S^{m) to be the set of m-tuples I = (/(I),... ,I{m)) such that 
YTs^^i uji{s) 7^ 0. Here, as in Section 4, ujj denotes the class of a Schubert variety 
^i{F.). 

Consider now m-tuples a(l), . . . , a{m), with each a{s) a nonincreasing sequence 
of real numbers of length at most n, written a{s) = (ai(s), . . . ,a„(s)). For any 
m-tuple I = (/(I), . . . , I{Tn)) of subsets of {1, . . . , n} of cardinality r we have the 
corresponding inequality 

^ m ^ m, n 

-EE "^(^) ^ -EE«*(^)- 

s=l je/(s) s=l i=l 

Lemma 3. Let a(s) be a partition of length at most n, for 1 < s < m. Let F,(s) 
he general flags inV , 1 < s < m, and let V'{s) be the filtration defined by the flag 

F, {s) and the partition a{s), for 1 < s < m. This m-filtration is semistable (resp. 
stable) if and only if the inequality (*x) holds (resp. with strict inequality) for every 

X in S^{m) and all r < n. 

This follows from Lemma 2 and the fact that, when the F,{s) are general flags, 
an intersection of Schubert cells D^i ^/(s)(-^»(^)) is nonempty if and only if the 
corresponding class Jl^x ^Hs) is not zero. 

The following, in the Hermitian case, is one of the main results of Klyachko's 
paper [KU]. From now on, we will state the results for an arbitrary number of 
factors, rather than the three that were featured in the first section; and wc will 
put them all on the same side of the equation. This allows a simpler and more 
natural expression of the results and proofs, by avoiding the kind of manipulations 
that we saw in the proof of Proposition 2. 
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Proposition 7. Let Q:(l), . . . , a{'m) be weakly decreasing sequences of n real num- 
bers. There are Hermitian n by n matrices A(l),... ,A(m), with a{s) the eigen- 
values of A{s), and J27=i "^(*") " scalar matrix, if and only if (*x) holds for every 
1 in 5"(m), for all r <n. 

The implication is proved as in Proposition 2, but simpler: If A{s) = c, 
^^'^ Yls'^iis) 7^ 0' there is some subspace L in Q°j^^^{F.{s)). By Proposition 1, 

m m 

Yl (^iis)<Yl Ra{s){L) = Rc{L) =c-r, 

s=l iel{s) s=l 

and c = ^ Tra.ce{J2s M^)) = ^ J2s Tl7=i This proves that (*x) holds. 

For the converse, first one checks that the region defined by the inequalities (*x) 
and the inequalities that make each of a{s) weakly decreasing has a nonempty 
interior ([Fu3], Lemma 2). By a continuity argument, using the compactness of 
the unitary group U (n) , it suffices to prove the existence of the Hermitian matrices 
when each a{s) consists of n distinct rational numbers, and all of the inequalities 
are strict; therefore (since multiplying matrices by scalars or adding scalar 
matrices doesn't change the situation), we may assume that each a{s) consists of 
n distinct nonnegative integers. Take general flags F.{s), 1 < s < m, and use the 
partitions a{s) to construct filtrations V'{s). We have seen that this m-filtration 
is stable. By Proposition 6, there is a Hermitian metric on V so that the sum of 
the projections on the spaces Vp{s), p > 1, 1 < s < m, is a scalar. The conclusion 
follows from Lemma 1. 

Special cases of the following proposition were stated by V. B. Lidskii [LI] and 
proved by Horn [H2]. In fact, Horn used only calculus to show that, assuming 
that the eigenvalues are distinct, if one is on a boundary of the region of possible 
eigenvalues for real symmetric matrices, there must be an invariant subspace. This 
was an early step toward his conjecture. 

Proposition 8. Let A{s) be Hermitian n by n matrices, with eigenvalues a{s), for 
1 < s < m, whose sum A{s) is a scalar. Suppose that there is some I in S^{m) 
such that holds with equality. Then there is an r-dimensional subspace L of 

C" that is mapped to itself by each A{s). 

To prove this, take a flag F.{s) = F,{A{s)) corresponding to A{s) as in Proposi- 
tion 1. Let c = ^ J2s Si "1(5) be the scalar that the matrices A{s) sum to. Since 
T is in S^{m) there must be an r-dimensional subspace L of C" in Q.jf^g-^{F,{s)). 
By Proposition 1, 

\Y1Y^ ai(s)< J^i2A(.)(i) = J%,A(.)(i)=c. 

s iel(s) s 

Since holds with equality, we must have X]ie/(s) Q^iC'*) = for every s. 

By Corollary 1 to Proposition 1, each A[s) must map L to itself. 

Since each A(s) is Hermitian, it follows that it also maps L-^ to itself. The sum 
of the restrictions of A{s) to L (or L-*-) is the same scalar c, so the process can be 
repeated for these: either all the inequalities for them are strict, or they can be 
further decomposed. (This is essentially Totaro's proof [T] of Proposition 6 above.) 
Proposition 8 can also be proved from properties of moment maps, as in [Kn]. 
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Corollary. Suppose in addition that each A{s) is a real symrnetiie matrix, and 
Yls^i{s) '^''^ odd multiple o/ w^i,... ,r}; where r is the cardinality of each I{s). 
Then there is an r- dimensional subspace L o/R" that is preserved by each A{s). 

Indeed, the fact that Hs "^/(s) is an odd multiple of ,r} guarantees that the 
intersection f]^ 17/(5) (F.(s)) of real Schubert varieties must contain a real point. 
This is obvious if the flags are in general position, since the complex points occur 
in pairs. It is also true for arbitrary flags, by the results of [Ful], Chapter 13. 

This explains Example 1 in Section 1, which presents real symmetric matrices 
for which an equality is satisfied but for which there is no real subspace preserved 
by the matrices. In the present terminology, each J(s) = {2,4,6}, and ns^-f(s) = 
"^^{1,... ,r}] the corresponding intersection of three real Schubert varieties has two 
complex points, but no real points. See Sottile's article [So] for general results 
about real Schubert calculus. 

If each A{s) is real symmetric, if at least one has n distinct eigenvalues, and if 
some (*x) holds with equality, then there is a real subspace of dimension r preserved 
by each of the A{s). Indeed, the fact that the eigenvalues of at least one A{s) are 
distinct guarantees (by Proposition 10 below) that H^i '^/(s) = ^{i,...,r}j so the 
corollary applies. 

It is not hard to see that, in the list of inequalities (*x), those for which the sum 
of the codimcnsions of the i-^i(^s) is kss than r{n — r) follow from those where the 
sum of the codimensions is equal to r(n — r). The latter are the classes for which 
ris ^i(s) is a nonzero multiple of the class ,r} of a point. This is a consequence 
of Picri's formula for multiplying a Schubert class by the class (T(i) of a hyperplane. 
This also follows from the stronger results below. 

C. Woodward was the first to discover that this reduced list of inequalities (*x) 
is still redundant, by finding the example (12). These examples involve Schubert 
classes whose intersection number is greater than 1. P. Belkale conjectured and 
proved that all such inequalities must be redundant. To state his theorem, let 
i?"(m) be the set of m-tuples I of subsets of {1, . . . ,n} of cardinality r such that 

Us^Hs) = ^{l,...,r}- 

Proposition 9. On the space of m-tuples a(l),... ,a(m) of weakly decreasing 
sequences ofn real num,bers, the inequalities (*x) for all I in S^{m), and allr < n, 
are implied by those in R"{m) for all r <n. 

In fact, Knutson and Woodward proved that the extra inequalities are redundant 
in a particularly strong way, as we had observed in examples: 

Proposition 10. Let a(l), . . . , a(m) be weakly decreasing sequences ofn real num- 
bers, satisfying the inequalities (*x) fori in R^{m), all r < n. Suppose, for some 
r <n and some X in S"{m) \ R"{m), we have the reverse inequality 




Then this inequality is actually an equality. Moreover, for every s between 1 and 
m, at least one of the inequalities ai(s) > a2{s) > . . . > (Xn{s) must be an equality. 

The proofs of Belkale and Woodward follow an idea familiar in the study of 
stability of vector bundles: a maximal destabilizing subbundle with maximal slope 
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must be unique. As before, take general complete flags F,{s) in V = C". For 
any r-dimensional subspace L of V, there are unique subsets I{s) of {1, . . . ,n} 
of cardinality r such that L is in the intersection n^li ^/(s)(-^«('^))- ^(-^) = 
(7(1), . . . ,I{m)) be this m-tuple of subsets. Note that, since the flags are general, 
this intersection of Schubert cells is nonempty if and only if the corresponding 
product ^i{s) of Schubert classes is not zero. Define the slope of L, ii{L), by 

the formula 

m 

(32) M(i) = -^ ^ a,(s). 

s=l ie/(s) 

Suppose I is in 5;^(m) \ i?;?(m) and \ YJ^^i T.i(ii(s) > i E^li Er=i 
There must be some L in the intersection n^Li ^/(s)(^«("^))' ^.nd iJ,{L) > n{V) for 
any such L. Let n be the maximum of all slopes of all subspaces L, and take r 
maximal such that some L of dimension r has slope fi. Choose a subspace L of 
dimension r with /i(i) /i, and set X = T{L). Since ^i{L) > ^(V), I is not in 
i?"(m), so the intersection n^li ^/(s)(^»('*)) cardinality greater than 1. Let 
L' ^ Lhe another space in the same intersection. Consider the sum L + L', of some 
dimension r +t > r, and set /C = I{L + L'). The intersection LDL' has dimension 
r-t: set J = J{Ln L'). 

The linear algebra fact that, for all i and s, 

dim((L + L') n F,{s)) > dim(L n F,(s)) + dim(L' n Fi{s)) - dim((L n L') n Fi(s)), 

together with equation (31), implies that 

keK(s) iei(s) iG/(s) je./(,5) 

Since r • /i(L) = Es Eie/(s) ^^'^ similarly for the others, this implies the 
inequality 

(r + 1) • ii{L + L') > r- ti{L) + r ■ iJ,{L) -{r-t)- ii{L n L'). 

Since /x(L) = ^ and /x(LnL') < /z, the right side is at least 2r- iJL—{r—t)- n = {r+t)-ii. 
Hence ^(L + L') > fx, which violates the maximality of r. See [Gr] for more about 
maximal destabilizing subspaces. 

For the last assertion in Proposition 10, suppose that all the inequalities (*x) 
hold for I in SJ^{m), but that there is some I in SJ^{7v) \ R]]{m) for which equality 
holds. Suppose for some s that the eigenvalues ai(s),... ,an(s) are all distinct. 
We know that, for an appropriate metric on F = C", there are Hermitian operators 
A{s) whose sum is scalar and so that each of the general flags F, (s) is a flag F,{A{s)) 
corresponding to A{s). Since 2 is not in R"{m), there are at least two r-dimensional 
linear subspaces in Hrii ^i{s){P'{s))- But now for any L in n^^i ^i{s){F.{s)), by 
Proposition 1, 

8=1 iel{s) s=l 

^ ^ m n 

= -Ri:a{s){l) = -EE"'(*)- 

S=l j=l 
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Since the first and last terms are equal, we must have X^^gj^,) cKi(s) = Ra{s){L) for 
each s. But Corollary 1 to Proposition 1 implies that, if a{s) consists of n distinct 
eigenvalues, L is unique: there cannot be two or more such L in this intersection.* 

This argument applies also to the case where the intersection of the Schubert 
classes is not zero or a multiple of the class of a point, for in this case the corre- 
sponding intersection of Schubert varieties would be infinite. 

There is another fundamental fact from geometric invariant theory that Kly- 
achko uses to make the link between stability and highest weights. Namely, if C 
is the restriction to X of the standard line bundle 0(1) on the projective space 
F{E) = Gr{l,E), a point x is semistable exactly when some positive power 
has a section that is invariant by SL{V) and does not vanish at x. If each a{s) con- 
sists of nonnegative integers, the space of sections r{X,£'^^) is isomorphic to the 
dual of the tensor product ^'^iV{Na{s)) of representations with highest weights 
Na{s), where, for any n-tuple a = (ai,... ,Q!„), Na denotes (Nai,... ,7Vq;„). 
The bundle C'^^ has a nonzero 5'L(y)-invariant section if and only if the represen- 
tation (S'sV{Na{s)) contains the trivial representation of SL{V). The conditions 
(*x) imply that a general point x in X is semistable, and therefore ®sV{Na{s)) 
must contain this trivial representation. (For more detail see [Fu3]). This proves 

Proposition 11. Suppose a(l), . . . , a{s) are partitions of length at most n, and 
(*x) holds whenever Hs'^JCs) ~ '^{i>- -.f}- ^^^^n there is some positive integer N 
such that ®sy{Na{s)) contains the trivial representation of SL{V). 

8. COADJOINT ORBITS AND MOMENT MAPS 

The methods of coadjoint orbits and moment maps from symplectic geometry are 
powerful tools for studying many of the problems discussed here. A full discussion 
would go beyond the scope of this article. Here we sketch how this approach can be 
used to simplify and unify some of the results obtained in Section 7. An excellent 
accoimt of this method can be found in [Kn] . 

Let H be the space of Hermitian n by n matrices. For a a weakly decreasing 
sequence of n real numbers, let Oa C H he the set of Hermitian matrices with 
spectrum a. This is an orbit of the (compact) unitary group U{n) on H, acting 
by conjugation. The Lie algebra of U{'n) is the space u(n) of skew-symmetric 
matrices. Identify H with the (real) dual space u(n)* by the map which takes A in 
H to [B 1-^ —tr{iA ■ B)] in u(n)*. This identifies Oa with an orbit in u(n)*, i.e., 
with a coadjoint orbit. 

If each ai is integral, Oa can be identified with a partial flag variety Fl(a), which 
has subspaces of dimensions i for each i such that aj > a^+i. Corresponding to 
A in Oa one has the flag of sums of eigenspaces corresponding to ai, . . . , a^, for 
1 < i < n. As in Section 7, there is a natural embedding of Fl{a) in P{V{a)). 



*One can also prove this last part the same way as the first: If (*x) holds with equality, but 
there are two spaces L and L' in n^^i same argument shows that 

dim((L -I- L') n Fi{s)) = dim(L n Fi{s)) + dim(L' n Fi{s)) - dim((L n L') n Fi{s)) 

for any s and i such that aj(s) — > 0. If there is an s with all aj(s) distinct, the display 

must be an equality for all i. This implies, by an easy induction on i, that L n Fi{s) = L' n Fi{s) 
for all i, so L = L'. 
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Every coadjoint orbit has a natural symplcctic structure, as does every subman- 
ifold of projective space. It is a basic fact that these symplectic structures agree on 
Oa ^ Fl{a) C P{V{a)). 

Now suppose a{l), . . . , a{m) are given, each integral, with J2s Si Q;,(s) = 0. We 
have closed embeddings 

mm m 

^ = n Ocis) = n ^'("(*)) ^ n nviaism c p muviaism . 

5 = 1 5 = 1 5=1 

The diagonal action of U{n) on X is a Hamiltonion action, which determines a 
moment mapping from X to u(n)*. Identifying u(n)* with H, this moment mapping 

H : X^H 

is simply addition: . . . , A{m)) = A{1) + . . . + A{m). 

A basic result of symplectic geometry — inspired, interestingly, by A. Horn's 
other paper [HI] about eigenvalues of Hermitian matrices — is that the intersection 
of IJ,{X) with a Weyl chamber is a convex polytope. Here it says that the possible 
eigenvalues 7 = (71 > . . . > 7n) of a sum ^ A{s) form a convex polytope. 

In connection with the moment map fi, one has the symplectic reduction 
/Lt~^(0)/C/(n). This space is in fact homeomorphic to the quotient space of X by 
G = GL{n,C) constructed by geometric invariant theory ([MFK],§8): 

(33) fi-\0)/Uin) ^ X^yC ^ Proj(i?), 

where R = ®Rn, Rn = T{X,0{N))'^ = {(^V{Na{s))*f . From this one sees 
the equivalence of the following three conditions, which say that each of the three 
spaces in (33) is nonempty: 

(i) there are Hermitian A{1), . . . , A{m) with eigenvalues a{l), . . . , a{m) such 
that ^5 Ms) = 0; 

(ii) a general point (or some point) in X = Y\Fl{a{s)) is semistable; 

(iii) for infinitely many (or one) A'' > 1, (^sV{Na{s))'^ ^ 0. 

As in Section 7 (cf. [Fu3]), (ii) is equivalent to the condition that for general 
flags F.(l), . . . F.{m) and all subspaces L of C", 

m n 

dim(LnJ;(s)/LnFi_i(s)) < 0. 

5=1 i=l 

If I{s) = {i e {!,... ,n} I Ln F,{s) ^ L n F^^l{s)}, this says that L is in 
Since the flags arc general position, (ii) is equivalent to 

(ii') E5 Eie/(5) ^ whenever n5 '^Hs) + 0. 

The equivalence of (i), (ii'), and (iii) reproves Propositions 5, 7, and 11. For 
generalizations and variations, see [AW], [BS], [DRW], [He], [OS], [SQS], [Ta]. 

9. Saturation 

Proposition 11 is close to solving the highest weight problem, except for the 
serious possibility that the integer A'' that appears might be greater than 1. In 
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fact, for representations of some classical groups, such integers are necessary. For 
example, J. Stembridge points out that if w is the highest weight of the basic spin 
representation V{ui) of S'pmg(C), then V{u})(^V{uj) decomposes into four irreducible 
representations, while V{2u}) ^V{2u!) contains six irreducible representations of the 
form V{2ri) for some weights ry. Similarly in Schubert calculus, P. Pragacz points 
out that for the Lagrangian Grassmannian of 3-planes in 6-space, (7642 occurs in 
(T42 • 0-42. although (T321 does not occur in (T21 ■ 0-2.1. cf. [PR]. 

The key to this is provided by the following result of A. Knutson and T. Tao, 
which solves what is called the Saturation Problem. 

Proposition 12 [KT]. If a, /3, and 7 are a triple of partitions, and c^^^i^ ^ 0, 
then c^f^ 7^ 0. 

Their proof uses a wonderful new combinatorial description of these Littlewood- 
Richardson numbers as the number of geometric figures called "honeycombs" satis- 
fying some conditions. Their original proof also used another new description that 
they call "hives." Buch [Bu] has given a shorter version of their proof, entirely 
in the language of hives. For more about this Saturation Problem (before it was 
solved), see A. Zelevinsky's article [Z]. Recently H. Derksen and J. Weyman [DW] 
have given a different proof of Proposition 12, using representations of quivers. It 
is interesting to note that Johnson's unpublished thesis [J] contains a solution of 
the saturation problem when the partitions have lengths at most four. 

A. Postnikov and Zelevinsky have pointed out how this result is equivalent to 
one that looks stronger: 

Proposition 13. //a(l), . . . ,a(m) and j3 are partitions of length at most n, and 
N is a positive integer such that V{Nf3) occurs in the decomposition of ®sV{Na{s)) 
as representations ofGLn{C), then V{l3) occurs in the decomposition of^sV{a{s)). 

This follows from that fact that the number of times a representation V{(3) 
occurs in the decomposition of i8)sy(a(s)) is equal to a Littlewood-Richardson 
coefficient cj^, where a and 7 are obtained as follows. Arrange the Young diagrams 
of q:(1), a{2), . . . , a{m) corner to corner from lower left to upper right, so that the 
upper right corner of the diagram of a{i) just touches the lower left corner of 
a{i + l); these form a skew diagram 7 \ a. The reason for the equality is that both 
numbers are the numbers of skew tableaux on the shape 7x0: whose rectification 
is a given tableau of shape /? (cf. [Fu2], §5.1). Proposition 13 can also be proved 
directly, using the methods of [KT] or [DW]. 

Note that the corresponding result follows for representations of 5L„(C), since 
representations of 5'L„(C) differ from those of GL„(C) only by powers of the de- 
terminant representation /\" C" (cf. [FH], §15). 

Proposition 14. Let a{l), . . . , a{m) be partitions of lengths at most n, and sup- 
pose c = ^ X^^Li X] '^«('^) '^^ integer. Then the representation (E)^iV(a{s)) of 
GLniC) contains the representation (/\"C")®'^ if and only if (*x) holds for all 
I e S^{m) and all r <n. 

These conditions are equivalent to the condition that the Schur polynomial S(c»i) 
occurs in the product ■''«(«). 

As to geometric invariant theory itself, the ring ®]s[>or{X, jC'^^)^^'-^^ is the 
homogeneous coordinate ring of the GIT quotient variety X//SL{V) of X by SL{V); 
that is, X//SL{V) = Proj(ii). One may ask whether this graded ring R is generated 
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by its homogeneous part of degree 1. Unfortunately, this is not always true, even 
for these special varieties that are products of partial flag varieties: 

Example 5. For n = 3, let X be the Cartesian product of 6 copies of = 
and £ = 0(1, 1,1, 1,1,1) the tensor product of the pullbacks of the line bundles 
0(1) on the factors. The homogeneous part T{X, 0(2, 2, 2, 2, 2, 2))'5-^3(C) degree 
2 has dimension 16, while that of degree 1 has dimension 5, and 16 > (^^^)- 
Equivalently, if a = (5,4,3,2,1), /3 = (2,2,2), and 7 = (6,5,4,3,2,1), then cj^ = 

5, while 02^20 = 16 > ( "l^^)- Buch has verified by computer that the smallest 
AT • for which there is an a, /3, 7 and A'' for which Cj^^j^^ > C^""^^"^) is for 
N-J2lt = 30, with a = (4,3,2,1), /3 = (2,2,1), 7 = (5,4,3,2,1), and = 2, where 
again c^^ = 5, and 02^2/3 — ^'^^ ^ geometric description of {P^)^ //SL{3, C), see 
[DO], §VIL3. 

A. Okounkov speculates that the function f{N) = c^'*^^ may be log concave, 
i.e., that 

f{Nf>f{N-l)-f{N+l) 

for all N >1. 

For a general product X of partial flag varieties, is there a reasonable criterion 
to tell when this algebra is generated by its part in degree 1, or to find an N such 
that the algebra ©fe>or(X, /:«5feW)SL(y) generated by r(X, /:»^)^'^(^)? 

Knutson, Tao, and Woodward [KTW] have also announced a proof of a conjec- 
ture we had made that Cj^'*)y^ = 1 if and only if cj^ = 1. Buch [Bu] deduced 
from this that, for n > 3, if each of the partitions in {a, /?, 7) consists of n distinct 
integers, and if each of the inequalities {*ijk) holds strictly, then cj^ must be at 
least 2. 

10. Proofs of the theorems 

Most of the theorems in Sections 1-4 follow easily from the propositions proved in 
Sections 6, 7, and 9. Let us start by summarizing the main results, but generalized 
to an arbitrary number of factors. Alternative versions are stated in brackets. 

10.1. The main theorem. 

Theorem 17. Let a{l), . . . ,a{m) and 7 he weakly decreasing sequences of n real 
numbers, with X]r=i 7i ^ Sl'li Sr=i ^^ii^)- The following are equivalent: 

(1) EfceK Ik < J27=i J2iei(s) '^ii^) satisfied for all subsets /(I), . . . , I{m), K 
of the same cardinality r o/{l, ... , n} such that <J\{k) occurs in n^Li '^x{i{s)) 
[with coefficient 1] in H* (Gr(r,C"')), and all r < n. 

(2) a(l), . . . , a(m) and 7 are eigenvalues of Hermitian matrices A[l), . . . , A{m) 
and C, such that C = Y^^=\ ^(s)- 

If each o/a(l),... ,a(m) and') consists of integers, (1) and (2) are equivalent to 

(3) The representation ^(7) of GLn{C) occurs in <Si^iV{a{s)). 
If each of a(l), . . . , a(m) and 7 is a partition, these are equivalent to 

(4) For some [every] discrete valuation ring R, there exist n by n matrices 
A(l),... ,A{m) and C with invariant factors a(l),... ,a{m) and 7 such 
that C = A{1) ■ .. .■ A{m). 
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//, in addition, q is an integer at least as large as each ai(s) and 71, these are 

equivalent to 

(5) The class a-y occurs with positive coefficient in the product Y[T=i "'"(s) 
iJ*(Gr(n,C"+«)). 

We show how to deduce this theorem from the propositions in the preceding 
sections. As in the proof of Proposition 2, crA(x) occurs in YVILi o'a(/(s)) [with 
coefBcient 1] if and only if n^li ^i{s)' " is a nonzero multiple [equal to 1] of 
'-^{1,... ,r}- Let (3{s) = (— q:„(s), . . . , —ai{s)), for 1 < s < m, and let /3(m + 1)^7. 
The condition (2) is equivalent to the existence of Hermitian matrices B{s) with 
eigenvalues /3{s), 1 < s < m + 1, whose sum is zero (as one sees by setting A{s) = 
—B{s) and C — B{m+1)). From Proposition 7 [and Proposition 9], (2) is equivalent 
to the inequalities YlT=i J2iei{sy + J^keK. Ik < whenever Y[7=i '^Hs)' ■ '^k is 
a nonzero multiple [equal to 1] of w^i r}- This proves the equivalence of (1) and 
(2). 

Condition (3) is equivalent to (^^iV{a{i))* 0V{'j) containing the trivial repre- 
sentation of GLn{C), where V{a{s))* is the dual representation to V{a{s)), which 
has highest weight (— q:„(.s), . . . , — ai(s)). So the equivalence of (3) with (1) follows 
from Propositions 11 and 13. 

Condition (3) is equivalent to the assertion that the Schur polynomial Sj occurs 
in the product Jlg^x -^0(5) • The equivalence of this with (4) is scon by induction on 
m, the case m ^ 2 having been seen in Section 6. Indeed, occurs in Ils=i ^a(s) 
if and only if there is some (3 such that sp occurs in n^li^ Sa(s) occurs in 

Sfs ■ Sc«(m)- By induction, the first corresponds to a product A{1) • . . . • A{m — 1) = B 
with invariant factors a{l),... ,a{m — 1) and /3, and the second to a product 
B ■ A{m) = C with invariant factors /3, a(m), and 7. 

That (3) and (5) are equivalent follows as before from the fact that there is a 
homomorphism from the ring of symmetric polynomials onto the cohomology of 
the Grassmann manifold taking Sq, to ctq for all a. 

10.2. Horn's conjecture. We turn to the proof of Theorem 12, which implies 
Horn's conjecture. We start with a general combinatorial lemma. 

Lemma 4. (i) Let / = {«i < . . . < v} C {1, . . . , n}, and let X = {ir—r,... ,ii — 1) 
be the corresponding partition. Let I' = {i £ {1, • . • , n} | n + 1 — i ^ /}. Then the 
partition corresponding to I' is the conjugate partition X' to X. 

(a) IfX, n, and v are partitions, with conjugates A', and u' , then c^^ = Cy ^, . 

The first is an elementary combinatorial fact (cf. [Mac] §1(1.7)) that can be seen 
visually by identifying A with its Young diagram, which is traced out by a sequence 
of vertical steps going from lower left to upper right of the r by n — r rectangle 
of boxes. If A = A(/), then / specifics which of these n steps arc vertical, and its 
complement specifies which are horizontal, so /' specifies the vertical steps for A'. 
The second assertion is not obvious from the definition of Littlewood-Richardson 
coefficients, but follows from other descriptions, cf. [Fu2] §5.1. 

Now we prove that T" = S*" for all r < n, by induction on r. It follows from 
the definition that T'^ = 5^' = Let (/, J, K) be a triple in Ul^, with r > 1, and 
let (A, /U, u) be the corresponding triple of partitions, each of length at most r. The 
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condition for (7, J, K) to be in is that, for all p < r and all (F, G, H) e T^, 
I^V + I^is < ^fch+p(P + l)/2- 

feF gdG heH 

Since — a = A^+i-a, and similarly for J and K, this is equivalent to the inequality 

/eF geG ftei? 

The assumption that {I,J,K) is in [/" says that the triple (A, /x, satisfies (*), 
i.e., Aj + ^ /ij = ^ fj. The preceding inequality is therefore equivalent to 

/GF' 3GG' fteH' 

where _F' = {/ e {1, . . . , r} I r + 1 — / ^ F}, and G' and ff' defined similarly. 
By induction, we know that = 5^', and by Lemma 4 we know that {F, G, H) is 
in Sp if and only if {F',G',H') is in 5^_p. Hence the inequalities in question are 
equivalent to the inequalities 

Z < Z A/ + ^ /Zg 

heH feF geG 

for all g = r — p < r and all {F, G, H) in 5^. But by the equivalence of (3) and (5) 
of Theorem 17 (for the case m = 2), we know that the validity of these inequalities 
{*fgh) for all (F, G, H) in for all p < r is equivalent to the assertion that C)^^ 
is not zero. This means that (I, J, K) is in 5". 

The same idea proves the following version for an arbitrary number m of fac- 
tors, but in a symmetric form where all factors arc treated equally (so without a 
duality argument). Let C/"(rn) be the set of m-tuples /(I), . . . ,I{m) of subsets of 
cardinality r in {1, . . . , n} such that 

m 

(34) ^ ^ i = {m- l)r(n - r) + mr(r + l)/2. 

s=l ie/(s) 

Define S''^ {m) C U',}{rn) to be those m-tuples such that the product O'^/Cs) of the 
corresponding Schubert varieties is a nonzero multiple of the class of a point in 
Gr(r, C"), as in Section 6. Define T"(m) C C/"(m) inductively, to be the set such 
that, for all p < r and all = (F(l), . . . , F{m)) in T^{rn), 



(35) 



m 

i/(s) > (m — l)p(n — r) + TOp(p + l)/2. 

s=i /eF(s) 
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Theorem 18. S^{m) = T^{m) for all positive integers m, r, and n with r <n. 

This is easier to prove. For r = 1, Ti{m) = S'"(m) = Ui{m). For r > 1, 
let I = (/(I), . . . ,I{m)) be in U^{m), and define A(s) so that o'a(s) = i.e., 
ij{s) = n — r + j — Xj{s) for 1 < j < r. Then T is in T"(m) if and only if 
Z]/GF(s)(*/('^) - /) > ("^ — ^)p{n — r) for all T in T^{m) and all p < r. This is 
equivalent to the inequality E/eF(s) ^/(*) ^ P(n~r), or i E/6F(s) •^/(«) ^ 
n — r for all in T^{m) and all p < r. By Proposition 14, and the inductive fact 
that Tp{m) = Sp{m), this is equivalent to saying that Hs'^ACs) = dap for some 
d 0, where p = (n — rY . This says that T is in S^{m). 

By proving Theorems 17 and 18, we have completed the proofs of Theorems 1 
and 2 in their general forms for an arbitrary number of factors. Theorem 5, also 
for any number of factors, follows similarly from Proposition 8. 

10.3. Compact operators. To prove Theorem 6, for any number of compact 
self-adjoint operators A{1),... ,A{m) and C = J2T=i^i^) ^ Hilbert space H, 
we want to show that all the inequalities of (1) of Theorem 17 are valid. Choose a 
finite-dimensional subspace V oi H that contains eigenvectors corresponding to the 
n largest eigenvalues of each of the operators under consideration. It then suffices 
to apply the finite-dimensional result to the operators A{l)v, • • • , A{m)v, and Cy; 
here, for any operator A, Ay denotes the composite V ^ H ^ H ^ V , where 
the first map is the inclusion, the second A, and the third is orthogonal projection. 
The same argument as in Corollary 2 to Proposition 1 shows that the k*'^ largest 
eigenvalue of Ay is no larger than the fc*^ largest eigenvalue of A. The operators 
A{s)v and Cy will therefore have the same n largest eigenvalues as A{s) and C. 
For more on eigenvalues of compact operators, see [W], [Zw], [Fr], and [R]. 

10.4. Invariant factors. We have seen the general version of Theorem 7 in 

Theorem 17. Theorem 8 follows from Theorem 7 by localization. This is easiest 
to see in the version about short exact sequences of modules of finite length. To 
specify such an exact sequence over a principal ideal domain R is equivalent to 

specifying a short exact sequence of modules of finite length over a finite number 
of localizations Rp of _R, each of which is a discrete valuation ring. 

Theorem 9 follows easily from Theorem 8. Indeed, given C (i.e., X), there is an 
exact sequence 

where A, B, and C are the cokernels of A, B, and C, so the necessity follows 
from Theorem 8. Conversely, given A and B with invariants ai C . . . C Op and 
bi C ... C bq, and possible invariants ci C . . . C Cn satisfying the conditions 
displayed in Theorem 9, Theorem 8 implies that there is such an exact sequence, 
with A and B the cokernels of A and B. The standard argument used to resolve a 
short exact sequence of modules by a long exact sequence of free modules says that 
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the dotted arrows in the following diagram can be filled in: 



RP 



I 



^Ri 



A 



B 



RP ^ RP e R" -^—^ Ri 



Y 



A 



Here i and j are the canonical embeddings and projections: i{v) = {v, 0), j{v, w) = 
w. This produces a matrix C of the required form with cokcrncl C. 

To deduce Theorem 10 from Theorem 9, we use the standard fact that an en- 
domorphism of a finite dimensional vector space over a field F, up to similarity, is 
the same as a finitely generated torsion module over R = F[T], up to isomorphism. 
The invariant factors are the same for the endomorphism and the i?-module, and 
they determine the endomorphism up to similarity. 

10.5. Highest weights. Theorems 11 and 14 are special cases of Theorem 17 

together with Theorem 12. Theorem 13 follows from Proposition 9, together with 
the fact that (/, J, K) is in (resp. i?;?) if and only if (/', J', K) is in S';?(3) (resp. 
i?P(3)). Here R!^{m) is defined to be the subset of S'!^{m) such that the product 
n W7(s) is equal to the class of a point. 

10.6. Singular values. We next prove the theorems about singular values. For 
the additive version in Theorem 15, the fact that all the inequalities {**ijk) must 
be satisfied follows immediately from Theorem 1, since if C = A + S is a sum of 
arbitrary matrices, then 



is a sum of Hermitian matrices. (In fact, if one uses the set 5" instead of the set 

T", the necessity follows from the simple Proposition 2.) The full theorem was 
recently proved by O'Shea and Sjamaar [OS], §9, (also in terms of the sets S*"), 
using moment maps, although they do not mention singular values. Unlike the 
other theorems described here, we do not know another proof for this converse. 
The same arguments prove the generalization of Theorem 15 to any number of 
factors. 

The easy half of the multiplicative version (Theorem 16) can be proved by 
using a variation of the Hersch-Zwahlen lemma, which is based on an idea of 
Amir-Moez [AM], cf. [TTl]. First we require a multiplicative analogue of the 
Rayleigh trace. Suppose A is a positive semidefinite Hermitian n by n matrix. Let 
cti > . . . > a„ > be the eigenvalues of A, and let wi, . . . , be corresponding 
orthonormal eigenvectors. Let F.{A) be the flag with Fk{A) spanned by vi, . . . ,Vk- 
If L C C" is an r-dimensional subspace, define Da{L) to be the determinant of the 
matrix {Aui,Uj)i<ij<r, where Ui,... ,Ur is an orthonormal basis of L. Equiva- 
lently, Da{L) is the determinant of the mapping Al of Corollary 3 to Proposition 
1. 
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Proposition 15. For any subset / C {1, . . . ,n} of cardinality r, 

T]ai= Mm Da{L). 

To prove this, write Ui = ^ikVk- Then (Aui^Uj) is the i,j entry of the 

matrix X D X*, where X is the r by n matrix (xij), and D is the diagonal matrix 
with diagonal entries cti, . . . , Q!„. Therefore 

DAiL) = Y^detiX-') Y[ar det(X}) = ^ | det(X^)|2 • [] ^J, 
J j&j J j&j 

where the sum is over all subsets J of {1, ... ,n} of cardinality r, X'^ is the sub- 
matrix of X with columns labeled by J, and Xj is the submatrix of X* with rows 

labeled by J. Take Up in L n Fi^ successively as in the proof of Proposition 1. The 
p^^ row of the matrix X ends in column ip, so det(X'^) = unless jp < ip for all 
p. For such J, Uj^jo^i > Dje/^*- Moreover, Ej I det(X"')|2 > det(XX*) = 1. 
These inequalities imply that Da{L) > Hie / '^i- before, equality is achieved by 
taking L to be spanned by those vectors Vi for i in /. 

Proposition 16. Let C = A(l) ■ . . . ■ A(rn) be a product of m arbitrary n by n 
matrices. Suppose Li, . . . ,Lm are r- dimensional subspaces of with A{s){Ls) C 
Lg-i for 2 < s < m. Then 

rn 

Dc*c{Lm) = Y\_F)a(sYA(s){Ls)- 
s=l 

We first do the case where m = 2. Suppose C = A ■ B, and L and M are 
r-dimensional linear subspaces with B{M) C L. Wc must show that Dc*c{M) = 
Da*a{L) ■ D B* b{M) . Let Ui, . . . , be an orthonormal basis for L, and vi, . . . ,Vr 
an orthonormal basis for M. Write B{vi) = ^ XijUj, and let X = {xij). Then 

Dc'c{M) = det{{B*A*ABv^,Vjj) = det{{A* ABv^, Bvjj) 

= det(X) • detiiA*Aui,Uj)) ■ det(X*) = det^XX*) ■ detHAu^, Auj)) 
= detiXX*)-DA'A{L). 

But Db'b{M) = dQt{{Bvi, Bvj)) = dct{J2 XikXjiiukjUi)) — det{XX*), which 
proves the claim. The general case is proved from this by induction, setting 
A = A{1) ■ A{m - 1), B = A{m), M = L^, and L = Lm-i- 

Theorem 19. LetA{l),... ,A{m) ben by n matrices such that A{1)-. . .■A{m) = 1. 
Let ai(s) > ... > a„(s) > be the singular values of A{s), 1 < s < m. Sup- 
pose /(I), . . . , lim) are subsets of cardinality r of {1,. . . ,n} such that the product 
n ^/(s) of the corresponding Schubert classes is not zero. Then 

m 

n n «^(^) ^ 1 

s=l ie7(s) 

Note that ai(s)^ > • • • > an(s)^ are the eigenvalues of A{s)*A{s). To prove the 
theorem, take flags F,{s) corresponding to orthonormal bases of eigenvectors for 
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A{s). By Proposition 15, £'A(s)-A(s)(is) > Ilje/Cs) »»(«)^ ^« in n/(g)(j;(s)). 
By the hypothesis on the nonvanishing of the product of Schubert classes, we may 
find a subspace L in the intersection 

m— 1 

ni^m){F.{m)) n n ^nsMis + 1) ■ . . . • A{m))-\F.{s)). 

Set Ls = A{s + !)•...• A{m){L) for 1 < s < m. Then A{s){Ls) C Lg-i, and is 
in f2/(g)(i^.(s)). By the two preceding propositions, 

m m 

1 = D,{L) = n DAi,yAis){Ls) > n n "^(^)'- 

s=l s=lie7(s) 

Corollary. Lei A{1), . . . , A{m) be n by n matrices, and let C = A{1) ■ . . . ■ A{m). 
Let a{s) be the singular values of A{s), and c the singular values of C. Then for 
any /(I), . . . , I{m) and K subsets of {!,. . . , n} of cardinality r, such that (^\{k) 
appears in n^li ^a(/(s)), 

m 

n < n n 

keK s=i,e/(s) 

The proof is similar to that used in the additive case for oigonvahics: By a 
continuity argument, as in the proof of Proposition 7, it suffices to consider the 
case where all the matrices are invertible. If oi > . . . > a„ > are the singular 
values of A, then l/a„ > ... > l/oi arc the singular values of A^^ . Apply the 
theorem to C ■ A(m)-^ • . . . • A{1)-^ = 1, using the sets K and I{m)' , . . . ,/(!)', 
where I' denotes {n + 1 — « | « S /}, exactly as in the proof of Theorem 17. 

In another remarkable paper [K12], Klyachko has recently proved the converse, 
and the corollary itself, which specializes to Theorem 16 for m = 2. He deduces this 
from a computation of probability densities for random walks on the Lie algebra 
su(n) and the homogeneous space SL{n,C)/SU{n). Such densities for Euclidean 
3-space were computed by Lord Rayleigh. 

10.7. Real symmetric and Hermitian quaternionic matrices. We prove 
the general versions of Theorems 3 and 4, Proposition 7, and Theorem 17 for real 
symmetric and Hermitian (self-conjugate) quaternionic matrices. 

Theorem 20. The eigenvalues a{l), . . . , a{m), and 7 that occur for complex Her- 
mitian matrices A{1), . . . , A{m) and C = A{1) + . . . + A{m) are the same as the 
eigenvalues that occur for real symmetric or for Hermitian quaternionic matrices. 

For real symmetric matrices, it suffices to show that Proposition 7 is true for 
real symmetric matrices. The argument is similar to the complex case, rising the 
fact that if V is the complexification of a real vector space, then flags that are 
complexifications of real flags are Zariski dense in all flags. Prom this it follows 
that the Hermitian matrices constructed in Proposition 7 can all be taken to be 
real symmetric matrices. (See [Fu3] for details.) 

A Hermitian quaternionic matrix is an n by n matrix A with entries in the 
quaternions H which is equal to the transpose of its conjugate. Such a matrix acts 
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on the left on the (right) H-space H" of column vectors. Such a matrix again has all 
its eigenvalues real, and H" has an orthonormal basis of eigenvectors. In this case, 
one needs only prove that all the conditions (*x) are necessary. This follows exactly 
as in the proofs of Propositions 2 and 7, together with the observation that Schubert 
calculus for the quaternionic Grassmannian Gr{r, H") of (right) subspoaces of H" is 
exactly the same (but with all degrees doubled) as that for complex Grassmannians 
Gr(r,C"). This is proved in [PR]. 

Steinberg asks if one can characterize the eigenvalues of real skew-symmetric 
matrices A, B, and C = A + B. For any real matrix A, the matrix ^ _'^t o ) 
skew-symmetric, so this may be related to the problem of characterizing singular 
values of real matrices A, B, and C = A + B. 

11. Final remarks 

It would be interesting to find more direct relations between the subjects ad- 
dressed here, that would give better explanations of why the questions in each 
subject have the same answers. It follows from the main theorems in this article 
that in each of these subjects, the question of which triples {a, /?, 7) occur for a 
given n is completely determined by knowing the answer to the same question for 
all r < n. It is a challenge to find direct explanations for this in any of these 
situations. For example, is there a representation-theoretic explanation of the fact 
that knowing which irreducible representations of GL„(C) occur in tensor products 
is determined by the answer to this same question for GLr (C) for r < nl A similar 
question can be asked about the Schubert calculus for Gr(n, C"+^) being deter- 
mined by the Schubert calculus for all Gr{r, C") for r < n (at least the part about 
which Schubert classes appear in a product). One has a similar mystery for the 
invariant factors problems. For the eigenvalue problem, Theorem 5 and Proposition 
8 at least produce some smaller matrices. 

Agnihotri and Woodward [AW] and Belkale [Be] have proved an analogue of 
the Klyachko theorem for unitary matrices. They characterize the possible eigen- 
values of imitary matrices ^(1), • • • ,A{m) whose product is the identity. Instead 
of being controlled by Schubert calculus as in the Hermitian eigenvalue problem, 
it is controlled by quantum Schubert calculus. Although there are algorithms for 
compiiting in this quantum Schubert calculus, there are not yet true analogues of 
the Littlewood-Richardson rule, nor are there useful criteria for the nonvanishing 
of such products. 

One would like to see good analogues of these tlicxjrcnns for otlicir Lie groups. 
We know that at least the saturation problem must be modified, but it seems 
reasonable to hope for sharp analogues of many of the theorems, at least for the 
other classical groiips. The best results along these lines so far have been achieved 
by use of moment maps and coadjoint orbits, cf. [AW], [BS], [He], [OS]. Knutson 
and Tau end their paper [KT] with a precise conjecture. 

One of the relations that comes from this work is that between representations 
of the symmetric group and Hermitian eigenvalues. There is a deep relation, con- 
jectured by Baik, Deift, and Johansson [BDJ], and proved by Okounkov [O]. Any 
permutation w in 5'„, if written as a sequence of its values Wi, . . . , w„, determines 
a partition \{w) of n: X\{w) is the length of the longest increasing subsequence of 
w, Ai {w) + A2 {w) is the maximum sum of the lengths of the two disjoint increas- 
ing subsequences, and so on. The fact is that the distribution of these Afe(w), is 
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exactly the same as the distribution of the eigenvalues of a random Hermitian n by 
n matrix, suitably normalized, as n goes to infinity. Does this have any connection 
with the theorems presented here? 

It may be worth pointing out explicitly that although the problems solved in this 
story range over several areas of mathematics — including linear algebra, commuta- 
tive algebra, representation theory, intersection theory, and combinatorics — none 
of the people involved in the recent success came to the problems from any of these 
fields. Klyachko came from studying vector bundles on toric varieties, Totaro from 
studying filtered vector spaces using geometric invariant theory, Knutson, Agni- 
hotri and Woodward came from symplectic geometry, Tao from harmonic analysis, 
and Belkale from the study of local systems on Riemann surfaces. Klyachko's ar- 
ticle [KU] discusses some other interesting topics, such as spaces of polygons and 
Hermitian-Einstein metrics, that are related to the eigenvalue problem. 
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